Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Downloads of BioMart annotation tsv file happens in incorrect format #22

Open
suhaibMo opened this issue Apr 30, 2019 · 0 comments
Open

Comments

@suhaibMo
Copy link
Collaborator

suhaibMo commented Apr 30, 2019

In the E196 E!G43 and WBPS12 update, we noticed that the scala script src/pipeline/Start.sc successfully downloads the file. But for few plant species the file content were corrupted .
The file contents were having html/xml tags which is not useful.

brassica_napus.ensgene.interpro.tsv
hordeum_vulgare.ensgene.go.tsv
oryza_indica.ensgene.go.tsv
oryza_sativa.ensgene.interpro.tsv
physcomitrella_patens.ensgene.interpro.tsv
solanum_tuberosum.ensgene.interpro.tsv

The code didn't break because the downloaded file exists

Solution.
Need to come up with a way for identify correctness of the format/content, and look for any html tags in the file system.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant