Complete end-to-end documentation for single-node dstlr #20

lintool · 2019-10-02T22:09:02Z

We need complete end-to-end documentation for a single-node dstlr:

Ingesting Washington Post into Solr.
Running extraction on a subset of the docs. (I understand that extraction over the entire corpus might be unrealistic on a single node.)
Running enrichment.
Running sample data cleaning queries.

We have parts here and there already, but I'd like documentation down to the level of "copy and paste these commands" into a shell... and it should just work.

ryan-clancy · 2019-10-02T22:35:36Z

I've started a branch here for the updated documentation. I've added the instructions to build dstlr, fix an issue with CoreNLP 3.8 and Spark, added the Anserini/Solrini instructions, and updated some neo4j docs.

@x389liu Are you able to flush out more of the Running section? It might be good to point out what needs changing in each of the scripts (e.g., the neo4j password, amount of memory, # executors and # cores, etc.)

x389liu · 2019-10-02T22:44:12Z

@r-clancy yeah, I'll add more details to that branch.

x389liu · 2019-10-09T21:00:30Z

@lintool ryan and I have added detailed instructions on running single-node dstlr #26
I think this issue can be closed?

x389liu · 2019-10-09T21:12:58Z

Talked to @r-clancy. Before we close this issue, we'd like to run dstlr on a single himrod node following these instructions, check if more details are needed.

lintool · 2020-02-11T15:27:11Z

Bumping this - @x389liu you should work on this.
The Core18 instructions in the README can now just be replaced by the Solrini docs in Anserini.

lintool assigned ryan-clancy and x389liu Oct 2, 2019

lintool mentioned this issue Oct 2, 2019

Missing documentation about ingesting data #13

Closed

x389liu mentioned this issue Oct 9, 2019

Update README.md and running scripts #26

Merged

x389liu mentioned this issue Feb 26, 2020

Add an option to extract triples from only top n hits #30

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Complete end-to-end documentation for single-node dstlr #20

Complete end-to-end documentation for single-node dstlr #20

lintool commented Oct 2, 2019

ryan-clancy commented Oct 2, 2019

x389liu commented Oct 2, 2019

x389liu commented Oct 9, 2019

x389liu commented Oct 9, 2019

lintool commented Feb 11, 2020

Complete end-to-end documentation for single-node dstlr #20

Complete end-to-end documentation for single-node dstlr #20

Comments

lintool commented Oct 2, 2019

ryan-clancy commented Oct 2, 2019

x389liu commented Oct 2, 2019

x389liu commented Oct 9, 2019

x389liu commented Oct 9, 2019

lintool commented Feb 11, 2020