-
Notifications
You must be signed in to change notification settings - Fork 159
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ST_GeomFromGeoJson, ST_Intersects, and versions of Spatial Framework & of Hive #77
Comments
@uamadman When you built |
Also, there is a pre-built JAR file in the maven repositories: |
Please paste in your |
I accidentally left in ST_Linestring and ST_Polygon for this test... Prior to the following tests my add jar was taken directly out of the sample.sql
the current .sql file for my tests.
The terminal output:
|
For the query with ST_GeomFromGeoJson, I see |
My initial research makes it seem that migrating to .14 is unavailable for average users on the cloudera standard 5.3.1. What are the next steps to help nail down this issue for .13.1? |
Try setting |
That is a good start! It works for the basic tests above...
I am currently testing against some larger databases and they are behaving much differently. I'm not sure what that change has done but initial observations have shifted the load of larger jobs from my map to my reduce in a major way. I'm restructuring my select statements to get a sampling of results. My current .sql looks as such.
Initial samples:
Thoughts? Cheers! |
That's great news @uamadman, glad to see it's working. With Hive 13, this is the best workaround we know of. By setting caching to false, the performance in Hive is no longer optimized - which explains the slowdown. If you are still looking for increased performance, your best bet may be to contact Cloudera directly and see if they have a patched 13 version, or can get you on 14. Let us know how it goes :) |
I'll be switching the entire cluster over to the HortonWorks in order to gain access to 14. I will report back in a week. I would like to recommend 12 and 13.1 be documented in the requirements section in the read me. I would hate for people to simply give up after trying due to a rather difficult to diagnose hive issue. Cheers! |
We've recently added a Hive Compatibility wiki that will be helpful in addressing this. |
Similar for Esri JSON as with GeoJSON: SELECT ST_Intersects(ST_GeomFromJson('{"paths":[[[2.5,2.5],[8,0]]],"spatialReference":{"wkid":4326}}'),
ST_GeomFromJson('{"paths":[[[1.5,1.5],[0,7]]],"spatialReference":{"wkid":4326}}'));
true |
Basic test is confirmed to work properly in Hortonworks 2.2 with Hive .14
|
We are investigating the |
Apparently used by Hive .12 & .13 with hive.cache.expr.evaluation
JSON constructors getDisplayString (#77)
Reference on |
Ran into hive server stability issues with Hortonworks and their 2.2 build. Moved testing to Couldera's nightly build 5.5.0-1.cdh5.5.0.p0.869 and Hive-1.1. Initial observations of Hive-1.1 prove it to be compatible... And responding much faster (Atleast 10x) than Hive - 0.14.0 or below.
|
Good to know, thanks. |
Migrating Esri/geometry-api-java#81 to spatial-framework-for-hadoop from geometry-api-java
The text was updated successfully, but these errors were encountered: