You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @gabriel-baptiste, terribly sorry for not noticing this in over a year. I'm sure you've moved on, but if you're still interested I can definitely make this fix (assuming you haven't already). Thanks for letting me know!
Hello and thank you for this excellent guide.
I previously utilized this tool and all was working fine and very useful.
Recently, when using the following step:
curl 'https://www.pw.org/literary_agents?filter0=9677&field_electronic_submissions_value=All&items_per_page=All' > literary-agents-pw-org.html
xmllint --html --xpath "//li[contains(@Class, 'views-row')]" literary-agents-pw-org.html | xmllint --html --xpath '//text()' - | cut -d: -f2 | sed 's/^[\ ]+//g' > literary-agents-pw-org.txt
python literary_agent_parsing.py --input-file literary-agents-pw-org.txt | sort -u > literary-agents.txt
the terminal displays a variety of HTML parser errors and the the final output file - literary-agents.txt - is empty.
I am not sure why it is no longer working and I really appreciate any help you may provide. Thank you.
The text was updated successfully, but these errors were encountered: