Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Presentation RFC for DataBricks conference: DataFusion and Arrow #2323

Closed
alamb opened this issue Apr 23, 2022 · 7 comments
Closed

Presentation RFC for DataBricks conference: DataFusion and Arrow #2323

alamb opened this issue Apr 23, 2022 · 7 comments
Labels
documentation Improvements or additions to documentation help wanted Extra attention is needed

Comments

@alamb
Copy link
Contributor

alamb commented Apr 23, 2022

Overview

@Dandandan and I are working on a presentation about DataFusion at conference at the end of June 2022. In the spirit of open collaboration, we would like to solicit community feedback / crowd source some of the content prior to doing so.

I hope / expect some of the content from this talk (like "why use datafusion") may make it back into the main https://arrow.apache.org/datafusion/ documentation

Also, part of this presentation is about how DataFusion is used by downstream projects and we want to ensure our presentation is

  1. Technically Accurate
  2. Allowed by the downstream project to mention them and their use of DataFusion

Since most of the projects we have listed are open source, I don't think it will be a problem, but before we present I want an explicit signoff from people involved in the projects.

As an aside, it is very cool to see what others are doing with DataFusion

Talk Details:

Conference details https://databricks.com/dataaisummit/north-america-2022/agenda

https://databricks.com/dataaisummit/north-america-2022/agenda/?sessionid=1798

Talk Description:

DATA ENGINEERING
DataFusion and Arrow: Supercharge Your Data Analytical Tool with a Rusty Query Engine
Learn how Rust, the Apache Arrow project, and the Data Fusion Query Engine are increasingly being used to accelerate the creation of modern data stacks.

Presentation: https://docs.google.com/presentation/d/1q1bPibvu64k2b7LPi7Yyb0k3gA1BiUYiUbEklqW1Ckc

And so with that, please leave us comments !

@alamb alamb added documentation Improvements or additions to documentation help wanted Extra attention is needed labels Apr 23, 2022
@iravid
Copy link

iravid commented Apr 23, 2022

Thanks for including Coralogix on the slides :-) we'd be happy for that to stay on; would it be alright if we provide the text for the Overview and Use of Datafusion sections?

@Dandandan
Copy link
Contributor

Thanks for including Coralogix on the slides :-) we'd be happy for that to stay on; would it be alright if we provide the text for the Overview and Use of Datafusion sections?

Yeah, that would be great 👏

@andygrove
Copy link
Member

@Dandandan
Copy link
Contributor

Dandandan commented Apr 25, 2022

@yjshen (blaze-rs) @houqp (roapi, delta-lake) @jonmmease (VegaFusion) @gangliao (Flock) @paveltiunov/@ovr (Cube.js) @andygrove (dask-sql, Ballista), @rdettai (Cloudfuse Buzz)

As I think (one of) the main authors of those projects, would you be able to review or contribute the content of your project, or maybe help us finding someone who can?

@jonmmease
Copy link
Contributor

Thanks for the ping @alamb and @Dandandan. I'd be very happy to have VegaFusion mentioned! I added a couple of minor comments to the slide linked above, but it overall looks good to me!

@alamb
Copy link
Contributor Author

alamb commented Apr 26, 2022

@alamb It looks like these two presentations linked to from the main presentation are not publicly accessible:

@andygrove thank you for pointing that out -- given how the permissions are setup I can't make them publically accessable iin general but I can share them explicitly with inviduals. I am happy to do so with anyone who would like access

They are also on slideshare https://www.slideshare.net/AndrewLamb32/presentations

https://www.slideshare.net/AndrewLamb32/a-rusty-introduction-to-apache-arrow-and-how-it-applies-to-a-time-series-database

https://www.slideshare.net/AndrewLamb32/2021-0420-apache-arrow-and-its-impact-on-the-database-industrypptx

https://www.slideshare.net/AndrewLamb32/2021-1013-i-ox-query-processing

@jychen7
Copy link
Contributor

jychen7 commented May 8, 2022

nice, great presentation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

7 participants