Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add SankeiShimbun #688

Merged
merged 2 commits into from
Jan 31, 2025
Merged

Add SankeiShimbun #688

merged 2 commits into from
Jan 31, 2025

Conversation

MaxDall
Copy link
Collaborator

@MaxDall MaxDall commented Jan 21, 2025

No description provided.

@MaxDall MaxDall requested a review from addie9800 January 21, 2025 13:19
@MaxDall MaxDall changed the base branch from master to add-nikkei January 21, 2025 13:19
@MaxDall MaxDall changed the title Add 'SankeiShimbun' Add SankeiShimbun Jan 21, 2025
Copy link
Collaborator

@addie9800 addie9800 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for also adding this publisher 👍


@attribute
def topics(self) -> List[str]:
return generic_topic_parsing(self.precomputed.meta.get("news_keywords"))
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think I would not use the news_keywords, since it seems to be as if all articles habe the same generic topics:
['Sankei Shimbun', 'Sankei', 'Sankei', 'Newspaper', 'News', 'Breaking news', 'Politics', 'Economy', 'Society', 'International', 'Sports', 'Entertainment'].
Instead the section lists seems to be more relevant: //*[@id="main-column"]/main/article/div[1]/div/div[2]/div[1]/ul or the tag labels at the bottom of the article //*[@id="main-column"]/main/article/div[11]/div/div[1]

@MaxDall MaxDall requested a review from addie9800 January 28, 2025 09:06
addie9800
addie9800 previously approved these changes Jan 28, 2025
Base automatically changed from add-nikkei to master January 31, 2025 12:33
@MaxDall MaxDall dismissed addie9800’s stale review January 31, 2025 12:33

The base branch was changed.

@MaxDall MaxDall merged commit b037610 into master Jan 31, 2025
5 checks passed
@MaxDall MaxDall deleted the add-sankei-shimbun branch January 31, 2025 12:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants