Skip to content

Latest commit

 

History

History
70 lines (61 loc) · 4.98 KB

TechReading.md

File metadata and controls

70 lines (61 loc) · 4.98 KB

Tech papers on Current Reading List

Software Management Plan / SE

  1. machine-actionable Software Management Plan Ontology (maSMP Ontology)
  2. Usage guidance (aka profiles) for the machine-actionable Software Management Plan Ontology
  3. https://doi.org/10.37044/osf.io/k8znb
  4. D4.4 - Guidelines for recommended metadata standard for research software within EOSC
  5. Intelligent analysis for software data: research and applications
  6. What Do We (Not) Know About Research Software Engineering?
  7. The Research Software Encyclopedia: A Community Framework to Define Research Software
  8. Rethinking Software Engineering in the Foundation Model Era: A Curated Catalogue of Challenges in the Development of Trustworthy FMware
  9. Making Biomedical Research Software FAIR: Actionable Step-by-step Guidelines with a User-support Tool
  10. Ten quick tips for building FAIR workflows
  11. Understanding Fairness in Software Engineering: Insights from Stack Exchange Sites
  12. Scicodes
  13. Nine Best Practices for Research Software Registries and Repositories:A Concise Guide
  14. Citation File Format (CFF) vs BibTeX Conversion
  15. Software Heritage
  16. https://edsbook.org/notebooks/about - also mentions RO-Crate, https://drive.google.com/file/d/1INJBUfC_YZf9qVtaZ_lMpSayaE2SXF5s/view
  17. https://www.researchobject.org/ro-crate/
  18. https://www.rohub.org/
  19. RO-Crate
  20. Dias: Dynamic Rewriting of Pandas Code
  21. In Database Data Imputation : https://doi.org/10.1145/3639326
  22. DoppelGanger++: Towards Fast Dependency Graph Generation for Database Replay https://doi.org/10.1145/3639322
  23. Machine Unlearning in Learned Databases: An Experimental Analysis https://doi.org/10.1145/3639304
  24. Determining the Largest Overlap between Tables : https://doi.org/10.1145/3639303
  25. Modeling Shifting Workloads for Learned Database Systems : https://doi.org/10.1145/3639293
  26. Controllable Tabular Data Synthesis Using Diffusion Models : https://doi.org/10.1145/3639283
  27. Spruce: a Fast yet Space-saving Structure for Dynamic Graph Storage : https://doi.org/10.1145/3639282
  28. Optimizing Dataflow Systems for Scalable Interactive Visualization : https://doi.org/10.1145/3639276
  29. LIT: Lightning-fast In-memory Temporal Indexing : https://doi.org/10.1145/3639275
  30. Optimizing Nested Recursive Queries : https://doi.org/10.1145/3639271
  31. https://github.com/earthlab/earthpy/blob/main/.zenodo.json

Data Versioning

  1. DVC: Data Version Control - Git for Data & Models

Example Project : https://github.com/binzzheng/DVC-PyTorch

Software Citations

  1. Journal Production Guidance for Software and Data Citations
  2. Software Citation Principles
  3. citation-file-format
  4. cffinit
  5. Citation File Format - Status and current challenges
  6. How to cite and describe software
  7. Software Citation ; datacite.org
  8. Citation File Format 2021

Tech Bloggers (I like)

  1. https://third-bit.com/ideas/research/
  2. https://jzhao.xyz/
  3. https://vickiboykis.com/

General SE

  1. https://arxiv.org/abs/2310.10817
  2. Knowledge Graph
  3. https://wholetale.org/, https://github.com/whole-tale