Skip to content

v0.3.0

Compare
Choose a tag to compare
@MaxDall MaxDall released this 21 Apr 19:41
· 1245 commits to master since this release
28a74d0

๐Ÿš€ New Release: Version 0.3.0

We're excited to announce a significant overhaul of the Fundus crawling core logic in this release! We've transitioned from using asyncio to a ThreadPool-based solution, resulting in a more robust and performant system. Now, each publisher operates on its own thread, synchronized seamlessly through a queue.

Breaking changes

To provide a more streamlined experience, we've relocated every crawler-type specific parameter to its respective constructor. As a result, these parameters are no longer accessible through the crawl method:

  • delay -> Crawler
  • start, end -> CCNewsCrawler

Furthermore, since we removed asyncio, the crawl_async method is no longer available.

What's new

New publisher

Updating existing publisher

Bug fixes

QoL

  • Adds Pretty Print for PublisherCollection and PublisherSpec by @addie9800 in #399
  • Add custom filter for publisher_coverage to skip boolean values by @MaxDall in #408
  • Documentation Update: Explain Addition of New Countries by @addie9800 in #413
  • Attributes Parameter in Test Generation Script by @addie9800 in #411
  • Add body to unit tests by @MaxDall in #338
  • Adds a part about generate_tables script to the documentation by @MaxDall in #424

Maintenance

  • Update relevant actions to versions utilizing node 20 by @MaxDall in #417
  • Disable strict_query parsing for URL validation. by @MaxDall in #407

Full Changelog: v0.2.2...v0.3.0