feed.xml link field parsing error #111

jc955 · 2023-09-12T03:10:19Z

link field parsing error

- Update dependencies - Fix issue #111

jc955 · 2023-09-12T03:55:33Z

@neizod
Hi!
First and foremost, thanks for your work!
Ask another question.

https://www.historyinmemes.com/feed

This feed link actually has body content, but the feed extractor only displays a thumbnail text. How can this format be displayed in its entirety? Can you help me?

ndaidong · 2023-09-12T04:11:53Z

@0x1017 you can use parserOptions parameter to customize the output.

For example, if you turn off normalization, you can get the raw result, with full description:

  const feed = await extract('https://www.historyinmemes.com/feed', {
    normalization: false,
  })
  console.log(feed)

If you still want to normalize feed data, let's use getExtraEntryFields to modify only description as below:

  const feed = await extract(url, {
    getExtraEntryFields: (feedEntry) => {
      const { description } = feedEntry
      // you can do anything with the description here
      return {
        description,
      }
    },
  })
  console.log(feed)

LavaCxx · 2024-04-07T13:53:54Z

Even though I turned off normalization, the parsing of links still changed.

LavaCxx · 2024-04-07T14:34:48Z

After some research, I found that the problem is in the fast-xml-parser. Maybe the previous xml didn't support the & character, so this problem wasn't considered?
Anyway, if anyone else encounters this problem in the future, they can refer to this code.

await extract(url, {
    normalization: false, xmlParserOptions: {
        tagValueProcessor: (tagName, tagValue) => {
            if (tagName === 'link') return tagValue.replace(/&/g, '&amp;')
            return tagValue
        }
    }
})

ndaidong added a commit that referenced this issue Sep 12, 2023

v7.0.6

3f4239c

- Update dependencies - Fix issue #111

ndaidong mentioned this issue Sep 12, 2023

v7.0.6 #112

Merged

jc955 closed this as completed Sep 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feed.xml link field parsing error #111

feed.xml link field parsing error #111

jc955 commented Sep 12, 2023

jc955 commented Sep 12, 2023

ndaidong commented Sep 12, 2023

LavaCxx commented Apr 7, 2024

LavaCxx commented Apr 7, 2024 •

edited

Loading

feed.xml link field parsing error #111

feed.xml link field parsing error #111

Comments

jc955 commented Sep 12, 2023

jc955 commented Sep 12, 2023

ndaidong commented Sep 12, 2023

LavaCxx commented Apr 7, 2024

LavaCxx commented Apr 7, 2024 • edited Loading

LavaCxx commented Apr 7, 2024 •

edited

Loading