Hex badges show up as "invalid" #1285

eproxus · 2017-11-27T11:52:32Z

See examples on http://shields.io or in https://github.com/eproxus/meck/blob/master/README.md

paulmelnikow · 2017-11-27T22:47:00Z

Confirmed, these are all showing invalid.

https://img.shields.io/hexpm/dw/plug.svg

https://img.shields.io/hexpm/v/plug.svg

https://img.shields.io/hexpm/v/meck.svg

https://img.shields.io/hexpm/l/meck.svg

Probably an API change. Would someone like to look into it? Here are the tests.

PyvesB · 2017-11-28T09:52:40Z

I'll have a look into this issue sometime this week.

platan · 2017-11-28T16:02:51Z

I run Shields locally and all Hex.pm badges are working correctly. Service-tests are working as well.

paulmelnikow · 2017-11-28T16:36:35Z

Could you try the deployed commit too? It's possible, if unlikely, that it's been fixed since the deploy.

platan · 2017-11-28T17:00:09Z

I works with master (4b5bf03) and gh-pages (2fd5949).

PyvesB · 2017-11-28T20:23:44Z

I checked the API response, and it seems to be consistent with the processing done in the code. Tests are working fine as well.

I also fired a server up with the currently deployed commit on my local machine (Node 8.9.1). Hex badges are generated as expected:

Therefore I'm unsure why we are getting such errors on these badges. After a quick look into the hexpm repository, throttling/address blocking seems to be implemented on their side. Could we be hitting the rate limitations and trying to parse bogus responses, leading to "invalid" badges? Trying to make an API request directly from the production server may help us out here.

ericmj · 2017-11-28T21:26:50Z

If you provide a list of IPs that your production servers use I can can check if they have hit the rate limiting on Hex.pm.

paulmelnikow · 2017-11-29T15:35:02Z

Making a request to the production servers is a good idea. I don't have access yet, however.

Here are the three IPs:

$ host s0.shields-server.com
s0.shields-server.com is an alias for vps71670.vps.ovh.ca.
vps71670.vps.ovh.ca has address 192.99.59.72
$ host s1.shields-server.com
s1.shields-server.com is an alias for vps244529.ovh.net.
vps244529.ovh.net has address 51.254.114.150
$ host s2.shields-server.com
s2.shields-server.com is an alias for vps117870.vps.ovh.ca.
vps117870.vps.ovh.ca has address 149.56.96.133

ericmj · 2017-11-29T15:48:49Z

All of these IP addresses have been blocked because they consistently exceeded 100 requests/min to the Hex.pm API. We can unblock them but my guess is that they will hit the rate limiting again.

I have suggested before that shields.io should do conditional HTTP requests and do request collapsing. As an example when I load http://shields.io/ 3 individual API requests are made for each Hex.pm plug package badge. Why are these requests not collapsed into a single request, why is the cache-control, etag, and last-modified headers ignored?

This is ignoring the hundreds of other badges that are loaded from different services, refreshing http://shields.io/ is a great way to have your own little DOS service.

If shields will not improve its caching I guess we have to build a special endpoint that is cheaper and only returns the data you need and that we don't have to rate limit. If you let us know what endpoints you hit on the Hex.pm API and the fields you need we can build this endpoint for you.

paulmelnikow · 2017-11-29T18:51:22Z

I’m happy to discuss solutions. I joined the project several months ago so I wasn't part of the previous discussion. The Shields servers serve about 10k requests per minute, and while I don’t have per-service stats, I’m not surprised that the servers could at times make more than 100 req/min to Hex.pm.

The caching in Shields is based on the request. That means subsequent requests for the same badge will be cached for a while, though requests for different badges (e.g. license vs. version vs. downloads) will not. So the home page probably is not the problem. Once those badges have generated once, they will not make new requests until they are invalidated.

It would be nice to add caching for the service requests! Since a lot of projects will display multiple badges which pull the same data, we could save a sizable number of requests this way. I could see implementing it as part of the service rewrite.

Again I don't have exact numbers, but my impression is that Shields gets by on a tiny hosting budget, relying on optimized code, and avoiding any compute-intensive work. See this conversation on Twitter. The in-memory cache is size-limited to avoid OOM conditions, so I’m guessing to add a more sizable cache we’d need to add some hosting budget and back it with Redis or Memcache.

The data the current badges use is:

downloads.week
downloads.day
downloads.all
releases[0].version
meta.licenses

You could make a batch process that dumps this to a static file, which we could grab once an hour (or once a day) and keep in the cache.

Another thought… would it be possible to add caching behind your endpoint? What makes this request so expensive?

paulmelnikow · 2017-11-29T18:52:45Z

You might also consider issuing an API key for Shields, as some other services have done.

platan · 2017-11-29T20:15:51Z

Shields.io webpage currently displays about 320 badges. If it's visited frequently (more frequent requests from shields.io than from other pages) all badges should be in cache.
Referer stats from requests would give us answer about sources of traffic.

Do you know how much RAM does shields servers have? I would like to compare in memory caching made by shields code with Varnish Cache https://varnish-cache.org/intro/

ericmj · 2017-11-29T20:28:35Z

I have whitelisted the shields.io IPs so they should not be blocked in the future.

The data the current badges use is:

Thanks for this. I will create an optimized endpoint that only returns the data that shields uses, I will let you know when it's live.

Another thought… would it be possible to add caching behind your endpoint? What makes this request so expensive?

It's not super expensive, it's just that if we make a specific endpoint that ignores the rate limiting I feel like it should be optimized.

You might also consider issuing an API key for Shields, as some other services have done.

Sounds like a good idea, this way we don't have to rely on whitelisting specific IPs.

Shields.io webpage currently displays about 320 badges. If it's visited frequently (more frequent requests from shields.io than from other pages) all badges should be in cache.
Referer stats from requests would give us answer about sources of traffic.

This is not what I am seeing, if I grep our access logs and reload http://shields.io I see 3 new requests every time.

platan · 2017-11-29T20:35:44Z

This is not what I am seeing, if I grep our access logs and reload http://shields.io I see 3 new requests every time.

Thanks for this info! So they are not in shields cache (but we have 3 servers - 3 caches).

paulmelnikow · 2017-11-29T20:46:46Z

This is not what I am seeing, if I grep our access logs and reload http://shields.io I see 3 new requests every time.

Is that still happening after the whitelisting?

As @platan aptly observes, we do have three servers, and they don't share a cache. The requests should trickle to zero, as each of the three accumulates the five badges in its cache.

There are five Hex.pm badges on the page, so if nothing were being cached I'd expect to see five.

I have whitelisted the shields.io IPs so they should not be blocked in the future.

Thank you!

eproxus · 2018-06-26T07:06:15Z

@paulmelnikow @platan (cc @ericmj) The badges are still showing up as invalid, I think it hasn't really changed since last year. This affects all Erlang and Elixir projects that are using Shields.io for Hex.pm 🙁

ericmj · 2018-06-26T07:10:13Z

It was definitely fixed last year and they work for me when I go to shields.io although the badges loads slowly. What badges are invalid for you?

eproxus · 2018-06-26T07:18:10Z

Badges on https://github.com/eproxus/meck/blob/master/README.md

Looking at the requests it seems requests to to camo.githubusercontent.com times out:

[Error] Failed to load resource: the server responded with a status of 504 (Gateway Timeout) (68747470733a2f2f696d672e736869656c64732e696f2f686578706d2f762f6d65636b2e7376673f7374796c653d666c61742d737175617265, line 0)

Looking further, GitHub generates the following HTML for the README:

<a href="https://hex.pm/packages/meck" rel="nofollow">
    <img src="https://camo.githubusercontent.com/cc57adb0caefa2a016a54b863eded43f96dcd269/68747470733a2f2f696d672e736869656c64732e696f2f686578706d2f762f6d65636b2e7376673f7374796c653d666c61742d737175617265" 
         alt="Hex.pm Version"
         data-canonical-src="https://img.shields.io/hexpm/v/meck.svg?style=flat-square"
         style="max-width:100%;">
</a>

Accessing https://img.shields.io/hexpm/v/meck.svg?style=flat-square actually works, but I think the issue is that it is too slow (13.69 s) so probably GitHub's in-between caching layer breaks. A request to https://hex.pm/api/packages/meck is very fast, 137 ms (I assume this is the endpoint Shields.io is using).

Looks like the issue is the request speed to img.shields.io.

eproxus · 2018-06-26T07:23:34Z

Seems to be the case of #1568.

Closing this since it was just related to Hex.pm, which is now no longer the problem.

paulmelnikow added service-badge New or updated service badge question Support questions, usage questions, unconfirmed bugs, discussions, ideas labels Nov 27, 2017

paulmelnikow added operations Hosting, monitoring, and reliability for the production badge servers and removed question Support questions, usage questions, unconfirmed bugs, discussions, ideas labels Nov 29, 2017

RobertDober mentioned this issue Nov 30, 2017

Fix hex badge in README pragdave/earmark#167

Closed

paulmelnikow mentioned this issue Mar 19, 2018

Caching of API responses #1545

Closed

eproxus closed this as completed Jun 26, 2018

calebcartwright mentioned this issue May 1, 2020

Route Hex.pm requests through old badge server proxies #4994

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hex badges show up as "invalid" #1285

Hex badges show up as "invalid" #1285

eproxus commented Nov 27, 2017

paulmelnikow commented Nov 27, 2017

PyvesB commented Nov 28, 2017 •

edited

Loading

platan commented Nov 28, 2017

paulmelnikow commented Nov 28, 2017

platan commented Nov 28, 2017

PyvesB commented Nov 28, 2017 •

edited

Loading

ericmj commented Nov 28, 2017

paulmelnikow commented Nov 29, 2017

ericmj commented Nov 29, 2017

paulmelnikow commented Nov 29, 2017

paulmelnikow commented Nov 29, 2017

platan commented Nov 29, 2017

ericmj commented Nov 29, 2017

platan commented Nov 29, 2017

paulmelnikow commented Nov 29, 2017

eproxus commented Jun 26, 2018

ericmj commented Jun 26, 2018 •

edited

Loading

eproxus commented Jun 26, 2018

eproxus commented Jun 26, 2018

Hex badges show up as "invalid" #1285

Hex badges show up as "invalid" #1285

Comments

eproxus commented Nov 27, 2017

paulmelnikow commented Nov 27, 2017

PyvesB commented Nov 28, 2017 • edited Loading

platan commented Nov 28, 2017

paulmelnikow commented Nov 28, 2017

platan commented Nov 28, 2017

PyvesB commented Nov 28, 2017 • edited Loading

ericmj commented Nov 28, 2017

paulmelnikow commented Nov 29, 2017

ericmj commented Nov 29, 2017

paulmelnikow commented Nov 29, 2017

paulmelnikow commented Nov 29, 2017

platan commented Nov 29, 2017

ericmj commented Nov 29, 2017

platan commented Nov 29, 2017

paulmelnikow commented Nov 29, 2017

eproxus commented Jun 26, 2018

ericmj commented Jun 26, 2018 • edited Loading

eproxus commented Jun 26, 2018

eproxus commented Jun 26, 2018

PyvesB commented Nov 28, 2017 •

edited

Loading

PyvesB commented Nov 28, 2017 •

edited

Loading

ericmj commented Jun 26, 2018 •

edited

Loading