-
-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"invalid regex" error during manual OSM import #279
Comments
Does this mean that the http URLs are banned ? |
These warnings are generated when OSM data contains a URL in the name field.
Records containing URLs in the name field are skipped, these are considered 'bad data' as we don't want URLs to end up in the search engine. This feature was introduced in: pelias/model#115 These warnings are unfortunately common since OSM contains many data errors, you can ignore them: curl -Ls https://data.geocode.earth/osm/2022-35/france-valhalla.polylines.0sv.gz | pigz -d | grep -a 'http://' | cut -d '' -f2- --output-delimiter=$'\t'
Sofinel Runs Path http://www.sofinelruns.com
Sentier des Zaubis Maire Bouxières http://www.mairie-bouxieres-aux-dames.fr/wp-content/uploads/2005/01/Les-sentiers-de-Bouxi%C3%A8res-aux-Dames.pdf
Sentier des Chasupes Mairie Bouxières http://www.mairie-bouxieres-aux-dames.fr/wp-content/uploads/2005/01/Les-sentiers-de-Bouxi%C3%A8res-aux-Dames.pdf
Sentier des Quarterons http://www.mairie-bouxieres-aux-dames.fr/wp-content/uploads/2005/01/Les-sentiers-de-Bouxi%C3%A8res-aux-Dames.pdf |
I noticed that we improved detection of these streets in d4c5305 That was 5+ years ago, are you possibly running some ancient docker containers or something? |
This functionality was improved today in pelias/model#160, which added infix removal of URLs within |
Describe the bug
Hi, i am trying to import the following file proposed for France into polylines module of Pelias : https://data.geocode.earth/osm/2022-35/france-valhalla.polylines.0sv.gz
And a couple of errors appeared while i am not sure if the root cause comes from the data itself or the importer code.
Steps to Reproduce
The file have been previously gunzipped.
Running this command :
pelias import polylines
It brings then this kind of error:
[polyline] polyline document error message=invalid regex test, Maire Bouxières http://www.mairie-bouxieres-aux-dames.fr/wp-content/uploads/2005/01/Les-sentiers-de-Bouxi%C3%A8res-aux-Dames.pdf should not match /https?:\/\//, stack=PeliasModelError: invalid regex test, Maire Bouxières http://www.mairie-bouxieres-aux-dames.fr/wp-content/uploads/2005/01/Les-sentiers-de-Bouxi%C3%A8res-aux-Dames.pdf should not match /https?:\/\//
The job seems to continue anyway until its end.
Complete logs
Environment (please complete the following information):
The concerned environment is the docker stack provided into https://github.com/pelias/docker , running on a Debian 6.1 machine.
Thanks in advance
The text was updated successfully, but these errors were encountered: