CINXE.COM

Downloading 50,000 Podcast Feeds to Analyze Their RSS

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml" lang="en" xml:lang="en"> <head> <!-- Google tag (gtag.js) --> <script async src="https://www.googletagmanager.com/gtag/js?id=G-J80V1HPMKT"></script> <script> window.dataLayer = window.dataLayer || []; function gtag(){dataLayer.push(arguments);} gtag('js', new Date()); gtag('config', 'G-J80V1HPMKT'); </script> <title>Downloading 50,000 Podcast Feeds to Analyze Their RSS</title> <meta name="viewport" content="width=device-width, initial-scale=1"> <link rel="alternate" type="application/rss+xml" title="RSS Advisory Board" href="http://feeds.rssboard.org/rssboard" /> <link rel="preconnect" href="https://fonts.googleapis.com"> <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin> <link href="https://fonts.googleapis.com/css2?family=Anton&family=Lisu+Bosa:wght@700&display=swap" rel="stylesheet"> <link rel="stylesheet" href="https://stackpath.bootstrapcdn.com/bootstrap/4.4.1/css/bootstrap.min.css" integrity="sha384-Vkoo8x4CGsO3+Hhxv8T/Q5PaXtkKtu6ug5TOeNV6gBiFeWPGFN9MuhOf23Q9Ifjh" crossorigin="anonymous"> <style type="text/css" media="screen"> @import url("https://www.rssboard.org/images/stylesheet.css"); </style> <meta name="robots" content="noodp" /><meta name="description" content=" The software developer Niko Abeler has crawled 51,165 podcast feeds to study what RSS elements they contain. His" /><link rel="canonical" href="https://www.rssboard.org/news/219/downloading-50000-podcast-feeds-analyze" /> </head> <body> <!-- ukey="13C1F4FD" --> <!-- Begin Page --> <div class="container page"> <div class="row"> <div class="col col-md-12 banner"> <h1 class="bannerhead"><a href="/">RSS Advisory Board</a></h1> <h2 class="bannersubhead">Really Simple Syndication specifications, tutorials and discussion</h2> </div> </div> <div class="row"> <div class="col col-md-10"><!-- Begin Google AdSense --> <div align="center"> <p><script type="text/javascript"><!-- google_ad_client = "pub-8378161688790357"; /* RSS Board, 728x90, created 10/30/10 */ google_ad_slot = "8997688149"; google_ad_width = 728; google_ad_height = 90; //--> </script> <script type="text/javascript" src="//pagead2.googlesyndication.com/pagead/show_ads.js"> </script></p> </div> <!-- End Google AdSense --><!-- Begin Main Content --> <!-- Begin Entries --> <div class="entries"> <a name="219"></a><div class="entryhead"><p><a href="https://www.rssboard.org/news/219/downloading-50000-podcast-feeds-analyze">Downloading 50,000 Podcast Feeds to Analyze Their RSS</a></p></div><div class="entry"><figure class="text-center"><img src="https://www.rssboard.org/images/apple-podcasts-icon.png" alt="The Apple Podcasts icon, an i enclosed by two partial circles" class="img-fluid" width="250" /></a></figure> <p>The software developer Niko Abeler has crawled 51,165 podcast feeds to study what RSS elements they contain. His comprehensive <a href="https://podcast-standard.org/">Podcast Feed Standard report</a> looks at the usage of <a href="https://www.rssboard.org/rss-specification">core RSS elements</a> and namespace elements from Apple iTunes, Atom, Content, Podcast 2.0 and Simple Chapters. He writes:</p> <blockquote> <p>In the world of podcasting, there is a great deal of freedom when it comes to the format and content of a podcast. Creators are free to choose their own audio format and feed content, giving them the flexibility to create something truly unique. However, when it comes to distributing a podcast, certain standards must be followed in order to be added to an aggregator such as Apple Podcasts. Additionally, the podcasting community has come to agree upon certain conventions that can be used to add additional features to a podcast, such as chapters, enhanced audio, and more. These conventions allow for a more immersive and engaging listening experience for the audience.</p> <p>This website is dedicated to providing guidance and information on the conventions and standards used in podcasting.</p> </blockquote> <p>There's a lot of interesting data in the <a href="https://podcast-standard.org/rss_tags/">RSS 2.0 report</a>, which finds that these are the six least popular elements in an RSS feed's channel:</p> <p><table class="table"> <thead> <tr> <td><b>Element</b></td> <td><b>Usage</b></td> </tr> </thead> <tbody> <tr> <td><a href="https://www.rssboard.org/rss-profile#element-channel-docs">docs</a></td> <td>8.3%</td> </tr> <tr> <td><a href="https://www.rssboard.org/rss-profile#element-channel-cloud">cloud</a></td> <td>0.0%</td> </tr> <tr> <td><a href="https://www.rssboard.org/rss-profile#element-channel-rating">rating</a></td> <td>0.0%</td> </tr> <tr> <td><a href="https://www.rssboard.org/rss-profile#element-channel-skipdays">skipDays</a></td> <td>0.0%</td> </tr> <tr> <td><a href="https://www.rssboard.org/rss-profile#element-channel-skiphours">skipHours</a></td> <td>0.0%</td> </tr> <tr> <td><a href="https://www.rssboard.org/rss-profile#element-channel-textinput">textInput</a></td> <td>0.0%</td> </tr> </tbody> </table></p> <p>Over 99 percent of feeds contain the optional channel element <code>language</code> and the optional item elements <code>enclosure</code>, <code>guid</code>, <code>pubDate</code> and <code>title</code>. Only 0.2% of feeds contain a <code>source</code> element in an item.</p> <p>The iTunes namespace report shows a lot of variation in support. The required element <code>itunes:explicit</code> is only present in 18 percent of feeds and four optional elements have less than 20 percent: <code>itunes:new-feed-url</code>, <code>itunes:block</code>, <code>itunes:complete</code> and <code>itunes:title</code>. One namespace in the report, Podcast 2.0, has been proposed by Podcastindex "to provide a solution for problems which previously have been solved by multiple competing standards" and is still under development.</p> <p>The report also analyzes the audio files enclosed in the podcast feeds to determine their format, bitrate, channel and loudness. The report finds that 95.6 percent use MP3 and 4.4 percent AAC/M4A. People who like an alternative open source format will be oggravated that its sliver of the pie graph is so small it can't be seen.</p> <p>If Abeler isn't tired of crunching numbers, one thing that would be useful for the RSS Advisory Board to learn is how many of the feeds contain more than one <code>enclosure</code> element within a single item.</p> <a name="more"></a><div class="entryfooter">Posted by Rogers Cadenhead at 2023/07/14 10:38 AM | 9 COMMENTS | <a href="https://www.rssboard.org/news/219/downloading-50000-podcast-feeds-analyze">permalink</a></div></div> <div class="dateheader">Comments</div><a name="discuss"></a> <div class="entry"><a name="94032"></a><p>Hey,<br /> <p>thanks for coverage of my work. I'm glad that this is helpful to you :)<br /> <p>I did a bit more number crunching and checked the use of enclosure elements. I don't yet have a good place to display this on the website, so I'll just leave it here as a comment for the time being.<br /> <p>Based on 8.204.321 analyzed episodes (with at least one enclosure) the vast majority only uses one enclosure: 8.200.331 or 99.95%.<br /> <p>2 enclosures: 2512 or 0.03%<br /> <p>3 enclosures: 326 <br /> <p>4 enclosures: 406 <br /> <p>5 enclosures: 204 <br /> <p>6 enclosures: 120 <br /> <p>The episode (actually 6 episodes) with the most enclosures in my dataset has 44!<br /> <p>Kind Regards<br /> <p>Niko<br /> <p> </p><div class="entryfooter">Posted by <a href="https://podcast-standard.org" rel="nofollow">Niko</a> at <a href="#94032">2023-07-21 02:29 PM</a></div></div><div class="entry"><a name="94033"></a><p>Interesting data! I was not expecting so few feeds including more than one enclosure, given that WordPress allows blogs to include multiple enclosures per post and it is used to create millions of RSS feeds.</p><div class="entryfooter">Posted by <a href="https://workbench.cadenhead.org/" rel="nofollow">Rogers Cadenhead</a> at <a href="#94033">2023-07-22 07:16 PM</a></div></div><div class="entry"><a name="94157"></a><p> <a href="https://qudratullah.com.pk/product-category/attar/">Best Attar for Men</a> - Explore the best attar for men at QudratUllah. Our collection includes bold and sophisticated scents that reflect masculine elegance. Discover fragrances that leave a lasting impression and complement your style perfectly.</p><div class="entryfooter">Posted by <a href="https://qudratullah.com.pk/" rel="nofollow">Qudratullah</a> at <a href="#94157">2024-10-15 05:45 AM</a></div></div><div class="entry"><a name="94292"></a><p> <a href="https://qudratullah.com.pk/product-category/qaida/">Noorani Qaida Online</a> - Learn the Noorani Qaida online with QudratUllah. Our accessible resources and materials make it convenient for you to practice and improve your Quranic reading skills from the comfort of your home. Start your journey today!</p><div class="entryfooter">Posted by <a href="https://qudratullah.com.pk/" rel="nofollow">Qudratullah</a> at <a href="#94292">2024-11-11 10:11 AM</a></div></div><div class="entry"><a name="94339"></a><p>About seven eight IDs and the military science is going did what you worry you awake</p><div class="entryfooter">Posted by <a href="About seven eight IDs and the military science is going did what you worry you awake" rel="nofollow">About seven eight IDs and the military science is going did what you worry you awake</a> at <a href="#94339">2024-11-25 01:05 PM</a></div></div><div class="entry"><a name="94348"></a><p>Janeiro this building were discussing death So some stage</p><div class="entryfooter">Posted by <a href="Janeiro this building were discussing death So some stage" rel="nofollow">Janeiro this building were discussing death So some stage</a> at <a href="#94348">2024-11-26 05:02 PM</a></div></div><div class="entry"><a name="94382"></a><p><a href="https://dinosaurgames.io">dinosaur game</a> is an entertaining adventure game that is played in an arcade-style format. Players are transported to the Jurassic Period.</p><div class="entryfooter">Posted by Nina Foreman at <a href="#94382">2024-12-02 11:24 PM</a></div></div><div class="entry"><a name="94383"></a><p><a href="https://dinosaurgames.io">dinosaur game</a> is an entertaining adventure game that is played in an arcade-style format. Players are transported to the Jurassic Period.</p><div class="entryfooter">Posted by Nina Foreman at <a href="#94383">2024-12-02 11:25 PM</a></div></div><div class="entry"><a name="94384"></a><p>Players are whisked away to the Jurassic Period in <a href="https://dinosaurgames.io">dinosaur game</a>, an exciting arcade-style adventure.</p><div class="entryfooter">Posted by Kristy O'Neil at <a href="#94384">2024-12-02 11:25 PM</a></div></div> <a name="reply"></a><form method="post" action="https://www.rssboard.org/news/219/downloading-50000-podcast-feeds-analyze#respond"> <table width="100%" cellspacing="0" cellpadding="2"> <tr valign="top"><td><label for="author">Name</label>: </td><td><input type="text" size="50" name="author" id="author" value="" tabindex="1" /></td></tr> <tr valign="top"><td><label for="link">Web Site</label>: </td><td><input type="text" size="50" name="link" id="link" value="" tabindex="2" /></td></tr> <tr valign="top"><td colspan="2"><label for="feedback_11">Comment</label>: </td></tr> <tr valign="top"><td colspan="2"><p><textarea name="feedback" id="feedback" cols="70" rows="18" tabindex="3"></textarea> <p><div class="entryhead" id="comment_preview_head"><p id="comment_preview"></p></div></td></tr> <tr valign="top"><td></td><td><input type="hidden" name="dex" value="" /><!-- <input type="text" name="abuseflag" value="8-222-208-146" size="30" /> --><input type="hidden" name="dkey" value="338" size="30" /><input type="submit" name="submit" value="Preview Comment" tabindex="4" /><input type="hidden" name="dkey" value="338" SIZE="30"></td></tr> </table> </form> <!-- End Entries --> </div><!-- End Main Content --> </div> <div class="col col-md-2 sidebar"> <!-- Begin Sidebar --> <p class="text-center"> <a href="/"><img src="/images/rss-feed-icon-96-by-96.png" width="96" alt="RSS icon" class="img-fluid"></a> </p> <div class="sidebargroup"> <div class="sidebartitle">Main Menu</div> <div class="sidebarlink"><a href="https://www.rssboard.org/">RSS Advisory Board</a></div> <div class="sidebarlink"><a href="https://www.rssboard.org/rss-specification">RSS Specification</a></div> <div class="sidebarlink"><a href="https://www.rssboard.org/rss-profile">RSS Profile</a></div> <div class="sidebarlink"><a href="https://www.rssboard.org/rss-validator/">RSS Validator</a></div> <div class="sidebarlink"><a href="https://www.rssboard.org/rss-autodiscovery">RSS Autodiscovery</a></div> <div class="sidebarlink"><a href="https://www.rssboard.org/advisory-board">Board Members</a></div> <div class="sidebarlink"><a href="https://www.rssboard.org/charter" rel="nofollow">Charter</a></div> <div class="sidebarlink"><a href="https://groups.google.com/g/rss-public">RSS-Public Mailing List</a></div> <div class="sidebarlink"><a href="https://www.rssboard.org/rss-language-codes">RSS Language Codes</a></div> <div class="sidebarlink"><a href="https://www.rssboard.org/rsscloud-interface">RSSCloud Interface</a></div> <div class="sidebarlink"><a href="https://www.rssboard.org/rss-history">RSS History</a></div> </div> <div class="sidebargroup"> <div class="sidebartitle">RSS Feeds</div> <div class="sidebarrsslink"><a href="http://feeds.rssboard.org/rssboard" rel=\"nofollow\"><img src="/images/rss-icon.png" alt="Subscribe to Really Simple Syndication" border="0" width="16" height="16" /></a> <a href="http://feeds.rssboard.org/rssboard" rel=\"nofollow\">This Weblog</a></div> </div> <div class="sidebargroup"> <div class="sidebartitle">Social Media</div> <div class="sidebarlink"><a rel="me" href="https://mastodon.social/@rssboard">Mastodon</a></div> </div> <div class="sidebargroup"> <div class="sidebartitle">Spec Library</div> <div class="sidebarlink"><a href="https://www.rssboard.org/rss-specification">RSS 2.0 (Current)</a></div><div class="sidebarlink"><a href="https://www.rssboard.org/rss-2-0">RSS 2.0</a></div><div class="sidebarlink"><a href="https://www.rssboard.org/rss-0-9-2">RSS 0.92</a></div><div class="sidebarlink"><a href="https://www.rssboard.org/rss-0-9-1">RSS 0.91</a> (UserLand)</div><div class="sidebarlink"><a href="https://www.rssboard.org/rss-0-9-1-netscape">RSS 0.91</a> (Netscape)</div><div class="sidebarlink"><a href="https://www.rssboard.org/rss-0-9-0">RSS 0.90</a></div> <div class="sidebarlink"><a href="https://www.rssboard.org/rss-change-notes">RSS Change Notes</a></div> </div> <div class="sidebargroup"> <div class="sidebartitle">Namespaces</div> <div class="sidebarlink"><a href="https://www.rssboard.org/creative-commons">Creative Commons</a></div> <div class="sidebarlink"><a href="https://www.rssboard.org/media-rss">Media RSS</a></div> <div class="sidebarlink"><a href="https://www.rssboard.org/trackback">Trackback</a></div> </div> <div class="sidebargroup"> <div class="sidebartitle">Archive</div> <div class="sidebarlink"><a href="https://www.rssboard.org/site-index" rel="nofollow">Site Index</a></div> </div> <div class="sidebargroup"> <div class="sidebartitle">Board Members</div> <div class="sidebarlink">Rogers Cadenhead</div> <div class="sidebarlink">Sterling "Chip" Camden</div> <div class="sidebarlink">Simone Carletti</div> <div class="sidebarlink">James Holderness</div> <div class="sidebarlink">Jenny Levine</div> <div class="sidebarlink">Eric Lunt</div> <div class="sidebarlink">Randy Charles Morin</div> <div class="sidebarlink">Ryan Parman</div> <div class="sidebarlink">Jake Savin</div> <div class="sidebarlink">Jason Shellen</div> <div class="sidebarlink">Paul Querna</div> </div> <div class="sidebargroup"> <div class="sidebartitle">Other Formats</div> <div class="sidebarlink"><a href="https://datatracker.ietf.org/doc/html/rfc4287" rel="nofollow">Atom</a></div> <div class="sidebarlink"><a href="http://web.resource.org/rss/1.0/" rel="nofollow">RDF Site Summary</a></div> </div> <!-- End Sidebar --> </div> </div> <div class="row top10header"> <div class="col"> <h2 class="top10">Popular Pages on This Site</h2> </div> </div> <div class="row"> <div class="col"> <ul> <li><a href="https://www.rssboard.org/rss-validator/">RSS Validator</a></li> <li><a href="https://www.rssboard.org/media-rss">Media RSS Specification (Current)</a></li> <li><a href="https://www.rssboard.org/rss-specification">RSS 2.0 Specification</a></li> <li><a href="https://www.rssboard.org/rss-profile">RSS Best Practices Profile</a></li> <li><a href="https://www.rssboard.org/redirect-rss-feed">How to Redirect an RSS Feed</a></li> </ul> </div> <div class="col"> <ul> <li><a href="https://www.rssboard.org/rss-language-codes">RSS Language Codes</a></li> <li><a href="https://www.rssboard.org/rss-enclosures-use-case">Payloads for RSS</a></li> <li><a href="https://www.rssboard.org/rss-encoding-examples">RSS Encoding Examples</a></li> <li><a href="https://www.rssboard.org/rss-autodiscovery">RSS Autodiscovery</a></li> <li><a href="https://www.rssboard.org/news/212/read-rss-feed-php-using">How to Read an RSS Feed with PHP Using SimplePie</a></li> </ul> </div> </div></div> <!-- End Page --> <div class="footer"> <p><a href="/">RSS Advisory Board</a> | <a href="http://feeds.rssboard.org/rssboard">RSS Feed</a> | <a href="https://www.rssboard.org/rss-specification">RSS Specification</a> | Hosted by <a href="https://workbench.cadenhead.org/">Workbench</a></p> </div> </div> <script src="https://code.jquery.com/jquery-3.4.1.slim.min.js" integrity="sha384-J6qa4849blE2+poT4WnyKhv5vZF5SrPo0iEjwBvKU7imGFAV0wwj1yYfoRSJoZ+n" crossorigin="anonymous"></script> <script src="https://cdn.jsdelivr.net/npm/popper.js@1.16.0/dist/umd/popper.min.js" integrity="sha384-Q6E9RHvbIyZFJoft+2mJbHaEWldlvI9IOYy5n3zV9zzTtmI3UksdQRVvoxMfooAo" crossorigin="anonymous"></script> <script src="https://stackpath.bootstrapcdn.com/bootstrap/4.4.1/js/bootstrap.min.js" integrity="sha384-wfSDF2E50Y2D1uUdj0O3uMBJnjuUD4Ih7YwaYd1iqfktj0Uod8GCExl3Og8ifwB6" crossorigin="anonymous"></script> </body> </html>

Pages: 1 2 3 4 5 6 7 8 9 10