OctoMap with CUDA or OpenCL #29

ahmedshafeeq · 2013-05-25T18:54:33Z

I think it'll be cool to have OctoMap implemented in CUDA or OpenCL to speed things up even more.

aitjellal · 2014-07-25T11:05:11Z

Hi Ahmed is there anything new about Implementing OctoMap
with CUDA/Opencl
Please let me know if you are working on this project

ahmedshafeeq · 2014-07-27T21:05:30Z

How urgently do you need this feature? If you'd like to work together, I'm willing to commit time on this.

aitjellal · 2014-07-28T06:35:46Z

Thanks for you reply.
It will be perfect if it implemented within 1 or 2 months.
I am new in programming with opencl and have no experience in programming with CUDA
so I am not sure if I will be very helpful.
But what I can do is that once a first version is released I can enhance it.

ahmedshafeeq · 2014-07-28T13:48:24Z

Alright let me try to get this out within 2 months.

aitjellal · 2014-07-28T20:50:36Z

If you can divide the task into some sub-tasks I will be glad to contribute by
making one or two tasks to speed up things.

ahmedshafeeq · 2014-07-29T15:08:24Z

@aitjellal How about looking at https://github.com/OctoMap/octomap/blob/26dc8e9ab51a51bcd0dd6297f20d11a127a7076b/octomap/include/octomap/OccupancyOcTreeBase.hxx and re-writing what you can using only the functions in http://www.cplusplus.com/reference/algorithm/

The parts that can be re-written functionally are the easiest to speed up.

ahmedshafeeq · 2014-07-29T15:08:47Z

I'll move to them to CUDA after.

aitjellal · 2014-07-29T18:49:43Z

Ok I will work on it in my free time

ahmedshafeeq · 2014-07-30T16:06:40Z

The thrust library has implemented efficient parallel versions of the algorithms in http://www.cplusplus.com/reference/algorithm/. Check thrust out here: https://github.com/thrust/thrust. In those cases that thrust has no equivalent function, I can implement them myself. The nature of the functions in the algorithm header file make it easy for one to parallelize as they tend to be very declarative in nature rather than imperative.

Squelsh · 2014-08-21T11:35:33Z

Hey guys!

I am the maintainer of the GPU-Voxels lib, which is a kind of improved Octomap for the GPU and which also handles real time collision detection.
Have a look at the website www.gpu-voxels.org.
There you can also find my 80-slides-presentation from a Workshop on the lib that we gave on the IAS conference in Padua 3 weeks ago. Our lib will go OpenSource on github within the next days / few weeks.

My intention once was to offer the same API as Octomap but we did not put that too high on our feature list, as our main intention is Collision Detection with real time 3D maps. But if you like, we could join forces and come up with a wrapper or API enhancement, so that people could easily switch and use the computational power of GPUs... (Just to get an idea: The insertion of Kinect data into a TITAN-GPU Octree is about 100 times faster than into Octomap on a high end Intel machine, including raytracing for free space calculation).

Best,
Andreas

aitjellal · 2014-08-21T12:57:32Z

Hi Andreas,
thank you for this valuable information.
I think it is a good idea to colaborate to come up with a wrapper or API enhancement for the GPU-voxels.
cheers

ahmedshafeeq · 2014-08-21T13:12:41Z

Hi Andreas,

What's stopping me at this point is access to a Linux-based GPU machine. Would you be able to provide access to one? I'd be able to put in time for this if I had access.

On 21 Aug, 2014, at 7:35 PM, "Andreas H." [email protected] wrote:

Hey guys!

I am the maintainer of the GPU-Voxels lib, which is a kind of improved Octomap for the GPU and which also handles real time collision detection.
Have a look at the website www.gpu-voxels.org.
There you can also find my 80-slides-presentation from a Workshop on the lib that we gave on the IAS conference in Padua 3 weeks ago. Our lib will go OpenSource on github within the next days / few weeks.

My intention once was to offer the same API as Octomap but we did not put that too high on our feature list, as our main intention is Collision Detection with real time 3D maps. But if you like, we could join forces and come up with a wrapper or API enhancement, so that people could easily switch and use the computational power of GPUs... (Just to get an idea: The insertion of Kinect data into a TITAN-GPU Octree is about 100 times faster than into Octomap on a high end Intel machine, including raytracing for free space calculation).

Best,
Andreas

�
Reply to this email directly or view it on GitHub.

Squelsh · 2014-08-21T13:40:05Z

I would recommend, that we wait until I get a version to github, so you could have a look at the code.
@ahmedshafeeq: Sorry, but our company policies do not allow VPN access for non company members. So no remote access.

ahornung · 2014-08-24T15:48:04Z

👍 Great to see this level of collaboration here, keep it up and let me know if there are any open questions!

yincanben · 2014-12-13T12:39:17Z

Have you implemented in CUDA or OpenCL to speed OctoMap ?

Squelsh · 2014-12-14T00:02:43Z

The GPU-Voxels Library is written in CUDA.
But up to now there is no integration between Octomap and GPU-Voxels.
A common interface is still on the wish list.

Am 13.12.2014 um 13:39 schrieb yincanben [email protected]:

Have you implemented in CUDA or OpenCL to speed OctoMap ?

—
Reply to this email directly or view it on GitHub.

Dr-LingyunXu · 2015-07-03T13:38:38Z

Is there any new progress in Octomap and CUDA combination ?

Squelsh · 2015-07-04T16:34:58Z

Nope. But if you don't have to put complex datatypes into your octree nodes, just download GPU-Voxels and have a try.

Squelsh · 2015-10-29T17:42:57Z

I just spent some time in investigating the OctoMap API and the collision check implementation of FCL that collides URDF-models with an Octree.

Long story short: The approaches are so different, that this can not be solved by intelligent wrapping / interfacing.

Longer version: FCL works with a Bounding Volume Hierarchy (a space dividing tree structure) and recursively iterates over models to lookup Voxels in the Octomap selectively. On the other side, GPU-Voxels uses an articulated pointcloud-representation as robot model and collides this with our own Octree implementation. This is done in a highly parallel manner, so we don't need a BVH.

So, when someone needs a collision check between a URDF-Robot and a live pointcloud, GPU-Voxels is magnitudes faster than OctoMap, as we can update our GPU-Octree with the data of several Kinects at full framerate AND do collision checking in parallel.
But as soon as you need a templated Octree with your own Voxel-DataTypes, use OctoMap.

Unfortunately I don't see a way to integrate this into MoveIt / FCL in a straight forward manner. So we stick with our own planners.

Regarding an OctoMap compatible API the insertion of Pointclouds is no problem, but it makes absolutely no sense to query single coordinates / voxels in GPU-Voxels in a serial fashion, as it is done with OctoMap. We can query whole pointclouds with a single call in a millisecond but serialization would nevertheless destroy performance.

ahornung · 2015-10-31T17:41:26Z

Thanks for the detailed clarification, Andreas. I'm closing this issue for now.

ahornung closed this as completed Oct 31, 2015

andre-nguyen mentioned this issue Mar 10, 2016

Cuda acceleration #112

Open

xtycoon mentioned this issue Jul 21, 2017

Convert octomap into gpu-voxel fzi-forschungszentrum-informatik/gpu-voxels#48

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OctoMap with CUDA or OpenCL #29

OctoMap with CUDA or OpenCL #29

ahmedshafeeq commented May 25, 2013

aitjellal commented Jul 25, 2014

ahmedshafeeq commented Jul 27, 2014

aitjellal commented Jul 28, 2014

ahmedshafeeq commented Jul 28, 2014

aitjellal commented Jul 28, 2014

ahmedshafeeq commented Jul 29, 2014

ahmedshafeeq commented Jul 29, 2014

aitjellal commented Jul 29, 2014

ahmedshafeeq commented Jul 30, 2014

Squelsh commented Aug 21, 2014

aitjellal commented Aug 21, 2014

ahmedshafeeq commented Aug 21, 2014

Squelsh commented Aug 21, 2014

ahornung commented Aug 24, 2014

yincanben commented Dec 13, 2014

Squelsh commented Dec 14, 2014

Dr-LingyunXu commented Jul 3, 2015

Squelsh commented Jul 4, 2015

Squelsh commented Oct 29, 2015

ahornung commented Oct 31, 2015

OctoMap with CUDA or OpenCL #29

OctoMap with CUDA or OpenCL #29

Comments

ahmedshafeeq commented May 25, 2013

aitjellal commented Jul 25, 2014

ahmedshafeeq commented Jul 27, 2014

aitjellal commented Jul 28, 2014

ahmedshafeeq commented Jul 28, 2014

aitjellal commented Jul 28, 2014

ahmedshafeeq commented Jul 29, 2014

ahmedshafeeq commented Jul 29, 2014

aitjellal commented Jul 29, 2014

ahmedshafeeq commented Jul 30, 2014

Squelsh commented Aug 21, 2014

aitjellal commented Aug 21, 2014

ahmedshafeeq commented Aug 21, 2014

Squelsh commented Aug 21, 2014

ahornung commented Aug 24, 2014

yincanben commented Dec 13, 2014

Squelsh commented Dec 14, 2014

Dr-LingyunXu commented Jul 3, 2015

Squelsh commented Jul 4, 2015

Squelsh commented Oct 29, 2015

ahornung commented Oct 31, 2015