Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(parser): 输入的链接与平台域名不匹配时报错 #77

Merged
merged 3 commits into from
Sep 6, 2024
Merged

Conversation

thep0y
Copy link
Contributor

@thep0y thep0y commented Sep 6, 2024

避免平台不一致时无意义地解析

Summary by CodeRabbit

  • New Features

    • Enhanced input validation for the Search component, ensuring only valid numbers or HTTPS URLs are processed.
    • Added error handling for domain validation in parsing functions across multiple platforms.
    • Introduced utility functions for domain validation and room ID parsing.
  • Bug Fixes

    • Improved error messaging for invalid inputs and domain mismatches.
  • Documentation

    • Updated documentation to reflect changes in function signatures and error handling.

@thep0y thep0y added the fix fix some bugs label Sep 6, 2024
Copy link

coderabbitai bot commented Sep 6, 2024

Warning

Rate limit exceeded

@thep0y has exceeded the limit for the number of commits or files that can be reviewed per hour. Please wait 21 minutes and 5 seconds before requesting another review.

How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

Commits

Files that changed from the base of the PR and between fa3a4a1 and 7cc8cbf.

Walkthrough

The changes across several files enhance input validation and error handling for various parsers, particularly for URLs and room IDs. New utility functions and constants have been introduced to streamline these processes, while existing functions have been modified to accept additional parameters for better validation. The overall structure has been improved for robustness and clarity in handling user inputs.

Changes

File Path Change Summary
src/components/search/index.tsx Enhanced input handling and validation in the Search component; added onInput function and improved validation logic.
src/components/settings/index.tsx Minor formatting change in the onClose prop for LazyDialog.
src/parser/bilibili/index.ts Modified createBilibiliParser to allow undefined return type; updated internal logic for room ID and URL assignment.
src/parser/douyin/index.ts Integrated parseRoomID logic into createDouyinParser; improved error handling during parsing.
src/parser/douyu/index.ts Changed DouyuParser constructor to accept only number; refactored parseRoomID for better input handling.
src/parser/huya/index.ts Removed standalone parseRoomID function; updated createHuyaParser to use a new utility function for parsing.
src/parser/index.ts Enhanced error handling in parse function; added domain extraction logic using getSecondLevelDomain.
src/parser/utils.ts Introduced utility functions for domain validation and room ID parsing; added error constants for clarity.
src/parser/validators.ts Added isValidNumberOrHttpsUrl function for validating input as either a number or HTTPS URL.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant SearchComponent
    participant Validator
    participant Parser

    User->>SearchComponent: Input data
    SearchComponent->>Validator: Validate input
    Validator-->>SearchComponent: Return validation result
    SearchComponent->>Parser: Parse validated input
    Parser-->>SearchComponent: Return parsed result
    SearchComponent-->>User: Display result
Loading

🐰 Hopping through the code with glee,
New validations are here, oh what a spree!
URLs and numbers, all checked with care,
Errors caught swiftly, no worries to bear.
With each little change, our work's a delight,
Let's celebrate coding, from morning till night! 🎉


Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

Share
Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>.
    • Generate unit testing code for this file.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai generate interesting stats about this repository and render them as a table.
    • @coderabbitai show all the console.log statements in this repository.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 4

Outside diff range, codebase verification and nitpick comments (3)
src/parser/validators.ts (1)

1-23: Well-implemented function with clear validation logic.

The function isValidNumberOrHttpsUrl is well-implemented with clear logic for validating numbers and HTTPS URLs. The use of a comprehensive regular expression to enforce HTTPS is a good security practice.

Suggestion for Improvement:
Consider adding more comments to explain the regular expression parts for better maintainability and readability.

src/parser/utils.ts (1)

19-37: Well-structured function with robust error handling.

The function parseRoomID is well-structured and integrates domain validation with robust error handling effectively. Using predefined constants for error messages enhances consistency.

Suggestion for Improvement:
Consider adding more detailed error messages to differentiate between different types of input errors, which could improve debugging and user feedback.

src/parser/index.ts (1)

Line range hint 7-87: Significant improvements in parsing logic and error handling.

The modifications to the parse function enhance its robustness and clarity, particularly with the integration of getSecondLevelDomain for domain validation and the improved error handling mechanisms.

Suggestions for Further Improvement:

  • Consider adding more specific error types or using an error handling library to manage different error states more granely.
  • Ensure that all possible error messages are well-documented and that they are consistent across different platforms.
Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

Commits

Files that changed from the base of the PR and between 94797b2 and 554bb92.

Files selected for processing (8)
  • src/components/search/index.tsx (5 hunks)
  • src/parser/bilibili/index.ts (2 hunks)
  • src/parser/douyin/index.ts (2 hunks)
  • src/parser/douyu/index.ts (3 hunks)
  • src/parser/huya/index.ts (2 hunks)
  • src/parser/index.ts (2 hunks)
  • src/parser/utils.ts (1 hunks)
  • src/parser/validators.ts (1 hunks)
Additional comments not posted (9)
src/parser/utils.ts (1)

1-2: Good use of constants for error messages.

Using constants for error messages enhances consistency and maintainability. Well done!

src/components/search/index.tsx (3)

14-14: Approved import for input validation.

The import of isValidNumberOrHttpsUrl is essential for the enhanced input validation logic in the component.


44-46: Approved new input handler function.

The onInput function correctly updates the input state and enhances code readability by replacing an inline function.


104-104: Approved update to button disabled condition.

The updated condition for disabling the LazyButton aligns with the new validation logic, ensuring that the button is only enabled when there is a valid input and a selected platform.

src/parser/douyin/index.ts (1)

4-4: Approved import for utility function.

The import of parseRoomID is essential for the new functionality in the createDouyinParser function.

src/parser/huya/index.ts (2)

4-4: Verify the import of parseRoomID.

Ensure that the parseRoomID function is correctly imported from ../utils and is being used appropriately within the file. This is crucial since the function's logic has been moved to a utility module.


250-255: Review the updated createHuyaParser function.

The function now correctly takes an additional parameter secondLevelDomain and passes it to parseRoomID. This is a good practice as it enhances the modularity and reusability of the code by delegating the room ID parsing to a utility function. However, ensure that all calls to createHuyaParser throughout the codebase have been updated to pass this new parameter.

Verification successful

No calls to createHuyaParser found in the codebase.

The search did not reveal any instances of createHuyaParser being called elsewhere in the codebase. This suggests that there are no calls that need to be updated to include the new secondLevelDomain parameter. If this function is intended to be used, ensure that any future calls include the new parameter.

Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Verify all function calls to `createHuyaParser` match the new signature.

# Test: Search for the function usage. Expect: Only occurrences of the new signature.
rg --type typescript -A 5 $'createHuyaParser'

Length of output: 79


Script:

#!/bin/bash
# Description: Verify all function calls to `createHuyaParser` match the new signature.

# Test: Search for the function usage. Expect: Only occurrences of the new signature.
rg --type ts -A 5 $'createHuyaParser'

Length of output: 409


Script:

#!/bin/bash
# Description: Verify all function calls to `createHuyaParser` match the new signature.

# Test: Search for the function usage across all file types.
rg -A 5 $'createHuyaParser'

Length of output: 399

src/parser/douyu/index.ts (2)

13-17: Verify the import of parseRoomID and related constants.

Ensure that the parseRoomID function along with INVALID_INPUT and WRONG_SECOND_LEVEL_DOMAIN constants are correctly imported from ../utils. These constants are crucial for handling errors related to room ID parsing.


340-354: Review the updated createDouyuParser function.

The function now correctly takes an additional parameter secondLevelDomain and passes it to parseRoomID. This is a good practice as it enhances the modularity and reusability of the code by delegating the room ID parsing to a utility function. Additionally, the function handles potential errors by checking if roomID is an instance of Error and returns it if so. This robust error handling is crucial for maintaining the integrity of the application.

Comment on lines 4 to 17
export const getSecondLevelDomain = (url: string) => {
// 移除协议部分
const domainPart = url.split("://")[1];

// 分割域名部分
const domainParts = domainPart.split(".");

// 如果域名部分少于两段,返回 null
//if (domainParts.length < 2) {
// return null;
//}

return domainParts[domainParts.length - 2];
};
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Clarify behavior and clean up commented code.

The function getSecondLevelDomain effectively extracts the second-level domain. However, consider removing the commented-out code if it is no longer needed, or clarify its purpose if it might be used in the future.

Additionally, ensure that the function handles cases where the URL does not contain a domain part, as the current implementation could throw an error.

Comment on lines +70 to +81
const value = input().trim();
const parsedInput = isValidNumberOrHttpsUrl(value);
if (parsedInput instanceof Error) {
setToast({ type: "error", message: parsedInput.message });
return;
}

setLoading(true);

await parse(
currentPlatform()!,
input(),
parsedInput,
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approved enhanced validation in parsing function.

The onParse function now includes robust input validation, which enhances security and user experience by preventing invalid inputs from being processed. Consider adding more detailed error logging for debugging purposes.

Consider adding detailed error logging for debugging purposes:

if (parsedInput instanceof Error) {
+  console.error("Validation error:", parsedInput);
  setToast({ type: "error", message: parsedInput.message });
  return;
}
Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
const value = input().trim();
const parsedInput = isValidNumberOrHttpsUrl(value);
if (parsedInput instanceof Error) {
setToast({ type: "error", message: parsedInput.message });
return;
}
setLoading(true);
await parse(
currentPlatform()!,
input(),
parsedInput,
const value = input().trim();
const parsedInput = isValidNumberOrHttpsUrl(value);
if (parsedInput instanceof Error) {
console.error("Validation error:", parsedInput);
setToast({ type: "error", message: parsedInput.message });
return;
}
setLoading(true);
await parse(
currentPlatform()!,
parsedInput,

Comment on lines 119 to 124
secondLevelDomain: string,
) {
const roomID = parseRoomID(input, secondLevelDomain);
if (roomID instanceof Error) return roomID;

if (!Number.isNaN(parsedValue)) {
return parsedValue;
}

try {
const url = new URL(trimmedInput);
const basepath = url.pathname.slice(1);
return Number.parseInt(basepath);
} catch {
throw new Error("Invalid input: not a number or valid URL");
}
return new DouyinParser(roomID);
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approved enhancements to createDouyinParser.

The addition of the secondLevelDomain parameter enhances the flexibility and robustness of the createDouyinParser function. Consider documenting the error handling behavior for better clarity.

Consider adding documentation for the error handling behavior:

/**
 * Creates a Douyin parser with enhanced input validation.
 * @param input The input URL or room ID.
 * @param secondLevelDomain The second-level domain to validate against.
 * @returns A new DouyinParser instance or an Error if validation fails.
 */
export default function createDouyinParser(
  input: string | number,
  secondLevelDomain: string,
): DouyinParser | Error {
  // Function implementation remains the same.
}

Comment on lines 213 to 225
secondLevelDomain: string,
) {
let roomID: number | undefined = undefined;
let url: string | undefined = undefined;

if (typeof input === "number") roomID = input;
else url = input;

if (url && getSecondLevelDomain(url) !== secondLevelDomain) {
return WRONG_SECOND_LEVEL_DOMAIN;
}

return new BilibiliParser(cookie, roomID, url);
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approved enhancements to createBilibiliParser.

The addition of the secondLevelDomain parameter and the validation step significantly enhances the robustness of the createBilibiliParser function by ensuring that only valid URLs are processed. Consider documenting the error handling behavior for better clarity.

Consider adding documentation for the error handling behavior:

/**
 * Creates a Bilibili parser with enhanced URL validation.
 * @param input The input URL or room ID.
 * @param cookie The authentication cookie.
 * @param secondLevelDomain The second-level domain to validate against.
 * @returns A new BilibiliParser instance or WRONG_SECOND_LEVEL_DOMAIN if validation fails.
 */
export default function createBilibiliParser(
  input: string | number,
  cookie: string,
  secondLevelDomain: string,
): BilibiliParser | typeof WRONG_SECOND_LEVEL_DOMAIN {
  // Function implementation remains the same.
}

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

Commits

Files that changed from the base of the PR and between 554bb92 and fa3a4a1.

Files selected for processing (7)
  • src/components/settings/index.tsx (1 hunks)
  • src/parser/bilibili/index.ts (1 hunks)
  • src/parser/douyin/index.ts (2 hunks)
  • src/parser/douyu/index.ts (3 hunks)
  • src/parser/huya/index.ts (2 hunks)
  • src/parser/index.ts (3 hunks)
  • src/parser/utils.ts (1 hunks)
Files skipped from review due to trivial changes (1)
  • src/components/settings/index.tsx
Files skipped from review as they are similar to previous changes (4)
  • src/parser/bilibili/index.ts
  • src/parser/douyu/index.ts
  • src/parser/index.ts
  • src/parser/utils.ts
Additional comments not posted (2)
src/parser/douyin/index.ts (1)

Line range hint 4-121: Well-structured and robust error handling in DouyinParser.

The class DouyinParser is well-structured with clear separation of concerns in its methods. The error handling within the parse method is particularly robust, ensuring that all errors are appropriately caught and returned, which aligns with the PR's goal to enhance robustness in handling mismatched links.

src/parser/huya/index.ts (1)

Line range hint 4-255: Well-structured and robust error handling in HuyaParser.

The class HuyaParser is well-structured with clear separation of concerns in its methods. The error handling within the parse method is particularly robust, ensuring that all errors are appropriately caught and returned, which aligns with the PR's goal to enhance robustness in handling mismatched links.

Comment on lines +117 to +121
export default function createDouyinParser(input: string | number) {
const roomID = parseRoomID(input);
return new DouyinParser(roomID);
}

function parseRoomID(input: string | number): number {
if (typeof input === "number") return input;

const trimmedInput = input.trim();
const parsedValue = Number.parseInt(trimmedInput);
if (roomID instanceof Error) return roomID;

if (!Number.isNaN(parsedValue)) {
return parsedValue;
}

try {
const url = new URL(trimmedInput);
const basepath = url.pathname.slice(1);
return Number.parseInt(basepath);
} catch {
throw new Error("Invalid input: not a number or valid URL");
}
return new DouyinParser(roomID);
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Refactored createDouyinParser with integrated error handling.

The refactoring of createDouyinParser to include the logic from parseRoomID simplifies the overall structure and improves error handling by directly returning Error instances when parsing fails. This change aligns with the PR's objectives to enhance robustness.

Consider adding documentation to explain the error handling behavior and the conditions under which an Error is returned, as previously suggested by another reviewer.

Would you like me to help draft the documentation for this function?

Comment on lines +250 to +255
export default function createHuyaParser(
input: string | number,
): HuyaParser | Error {
const roomID = parseRoomID(input);
if (roomID instanceof Error) return roomID;

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Refactored createHuyaParser with improved modularity and error handling.

The refactoring of createHuyaParser to use parseRoomID from ../utils centralizes the room ID parsing logic, enhancing modularity and maintainability. The direct return of Error instances when parsing fails improves error handling, aligning with the PR's objectives.

Consider adding documentation to explain the error handling behavior and the conditions under which an Error is returned.

Would you like me to help draft the documentation for this function?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
fix fix some bugs
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant