Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Made changes to Makefile #10

Merged
merged 6 commits into from
Mar 8, 2022

Conversation

shawntanzk
Copy link
Collaborator

  • remove reason
  • add download sources

Current error:

robot extract --method BOT -i mirror/uberon.owl -T terms.txt remove --axioms disjoint -o uberon_slice.owl
MISSING TERMS ERROR term(s) are required with --term or --term-file
For details see: http://robot.obolibrary.org/errors#missing-terms-error
Use the -vvv option to show the stack trace.
Use the --help option to see usage information.
make: *** [Makefile:71: uberon_slice.owl] Error 1

@shawntanzk
Copy link
Collaborator Author

Guessing new UBERON mirror has some changes and something that is being extracted in terms.txt is missing?

@shawntanzk
Copy link
Collaborator Author

just looked at terms.txt - don't understand this part:

it is taking all terms for sources_merged which is all the allen files, and extracting terms from it - these are all dhba, mba etc. terms
then we try to use terms.txt to extract from uberon which does not have these terms?
Did uberon used to merge these files in? Is that why it is is uberon?

@shawntanzk
Copy link
Collaborator Author

@dosumis - might need help with this

@shawntanzk
Copy link
Collaborator Author

Looking at tmp.owl -> is the aim of uberon_slice to extract out uberon terms in sources_merged.owl? In which case, the sparql query terms.sparql should have a filter to grab only uberon terms?

@shawntanzk
Copy link
Collaborator Author

tested the above and came up with the same issue

@matentzn
Copy link

matentzn commented Mar 3, 2022

@shawntanzk I can help you with this as well.

Generally, the seeds we use for extraction contain all classes in our ontologies, regardless of where they should be imported from. But of course, most of the classes will be ignored by robot extract - it will only import those classes that are actually present in the mirror.

What issue are you trying to address here?

@shawntanzk
Copy link
Collaborator Author

ok we found the problem in that the terms file has <> (eg http://purl.obolibrary.org/obo/UBERON_0001966)
robot extract doesn't seem too happy about this, cause when i remove < > from the terms.txt manually it works.
I currently have a filter to only take out uberon terms but i guess that isn't needed given extract ignores classes that isn't in. Will just add a command to remove the < >, ugur is helping me with it now :) thanks

@matentzn
Copy link

matentzn commented Mar 3, 2022

Consider how ODK does it:

$(PRESEED): $(SRCMERGED)
$(ROBOT) query -f csv -i $&lt; --query ../sparql/terms.sparql [email protected] &&
cat [email protected] | sort | uniq &gt; $@

Using the -f csv parameter!

@shawntanzk
Copy link
Collaborator Author

shawntanzk commented Mar 3, 2022

perfect, that saves the sed command :) thanks @matentzn
btws is this a bug that it can't deal with tsv? or like is it meant to be that way

@shawntanzk
Copy link
Collaborator Author

shawntanzk commented Mar 3, 2022

next error:

  File "/usr/local/lib/python3.8/dist-packages/pandas/io/excel/_openpyxl.py", line 48, in __init__
    from openpyxl.workbook import Workbook
ModuleNotFoundError: No module named 'openpyxl'
make: *** [Makefile:98: report.xlsx] Error 1

Guess that is just not on ODK container

@matentzn
Copy link

matentzn commented Mar 3, 2022

Yes, if you want to run this inside of ODK (@dosumis did not when he wrote the Makefile), then you need to install that dependency. This is how you can do that:

https://github.com/OBOFoundry/COB/blob/master/src/ontology/cob.Makefile#L86

But it is annoying, I grant you that. I would exclude the report.xlsx from the make all probably for now, and then ask David if he stills needs it. If so, add a goal that installs the dependencies like in the example above and run it just before, i.e.

sh run.sh make dependencies all

@shawntanzk
Copy link
Collaborator Author

report is important, I figured I'll just run on my local machine, but I guess its better to add the dependencies in to the makefile so others can run it

@shawntanzk
Copy link
Collaborator Author

yay full run! with a lot of help from a lot of people lol.
Looks like a lot of diff though, will look through a bit to see if it is just rearrangement or something more that we should look into.

@shawntanzk shawntanzk marked this pull request as ready for review March 7, 2022 11:47
@shawntanzk shawntanzk changed the base branch from master to issue_9 March 8, 2022 13:23
@shawntanzk shawntanzk changed the base branch from issue_9 to master March 8, 2022 13:24
@shawntanzk shawntanzk changed the base branch from master to issue_9 March 8, 2022 13:26
@shawntanzk shawntanzk merged commit 7f5ea6c into obophenotype:issue_9 Mar 8, 2022
@shawntanzk shawntanzk deleted the download-sources branch March 8, 2022 13:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants