You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jan 26, 2025. It is now read-only.
We need complete end-to-end documentation for a single-node dstlr:
Ingesting Washington Post into Solr.
Running extraction on a subset of the docs. (I understand that extraction over the entire corpus might be unrealistic on a single node.)
Running enrichment.
Running sample data cleaning queries.
We have parts here and there already, but I'd like documentation down to the level of "copy and paste these commands" into a shell... and it should just work.
The text was updated successfully, but these errors were encountered:
I've started a branch here for the updated documentation. I've added the instructions to build dstlr, fix an issue with CoreNLP 3.8 and Spark, added the Anserini/Solrini instructions, and updated some neo4j docs.
@x389liu Are you able to flush out more of the Running section? It might be good to point out what needs changing in each of the scripts (e.g., the neo4j password, amount of memory, # executors and # cores, etc.)
Talked to @r-clancy. Before we close this issue, we'd like to run dstlr on a single himrod node following these instructions, check if more details are needed.
We need complete end-to-end documentation for a single-node dstlr:
We have parts here and there already, but I'd like documentation down to the level of "copy and paste these commands" into a shell... and it should just work.
The text was updated successfully, but these errors were encountered: