diff --git a/jsoc/gsoc/hpc.md b/jsoc/gsoc/hpc.md index 78173eb1c2..c3e28003ce 100644 --- a/jsoc/gsoc/hpc.md +++ b/jsoc/gsoc/hpc.md @@ -2,14 +2,6 @@ Julia is emerging as a serious tool for technical computing and is ideally suited for the ever-growing needs of big data analytics. This set of proposed projects addresses specific areas for improvement in analytics algorithms and distributed data management. -## Scheduling Algorithms for Dagger - -**Difficulty:** Medium (175h) - -Dagger.jl is a native Julia framework and scheduler for distributed execution of Julia code and general purpose data parallelism, using dynamic, runtime-generated task graphs which are flexible enough to describe multiple classes of parallel algorithms. This project proposes to implement different scheduling algorithms for Dagger to optimize scheduling of certain classes of distributed algorithms, such as mapreduce and merge sort, and properly utilizing heterogeneous compute resources. Contributors will be expected to find published distributed scheduling algorithms and implement them on top of the Dagger framework, benchmarking scheduling performance on a variety of micro-benchmarks and real problems. - -Mentors: [Julian Samaroo](https://github.com/jpsamaroo), [Krystian GuliƄski](https://github.com/krynju) - ## Distributed Training **Difficulty:** Hard (350h) @@ -21,11 +13,3 @@ There are projects now that host the building blocks: [DaggerFlux.jl](https://gi **Skills:** Familiarity with UCX, representing execution models as DAGs, Flux.jl, CUDA.jl and data/model parallelism in machine learning **Mentors:** [Julian Samaroo](https://github.com/jpsamaroo), and [Dhairya Gandhi](https://github.com/DhairyaLGandhi) - -## Distributed Arrays over Dagger - -**Difficulty:** Medium (175h) - -Array programming is possibly the most powerful abstraction in Julia, yet our distributed arrays support leaves much to be desired. This project's goal is to implement a new distributed array type on top of the Dagger.jl framework, which will allow this new array type to be easily distributed, multithreaded, and support GPU execution. Contributors will be expected to implement a variety of operations, such as mapreduce, sorting, slicing, and linear algebra, on top of their distributed array implementation. Final results will include extensive scaling benchmarks on a range of configurations, as well as an extensive test suite for supported operations. - -Mentors: [Julian Samaroo](https://github.com/jpsamaroo), [Evelyne Ringoot](https://github.com/evelyne-ringoot)