0

How A HyperLogLog And Other Probabilistic Data Structures Work

Titus Brown’s Awesome Big Data Algorithms talk from PyCon 2013 is a fascinating look at probabilistic data structures and is worth a watch if you’re interested in computer science.  These sometimes mysterious structures with names like HyperLogLog and Bloom filter let you do seemingly impossible things like count more unique items than your computer has memory to store.  This is possible by using statistics to estimate values rather than trying to store them in memory.  This lets the structures scale to immense amounts of data, like what Google might process in a day crawling the entire internet!

In addition to Titus’ talk there’s a great explanation of the HyperLogLog data structure from Doug Turnbull that’s worth checking out too.  Probabilistic data structures are a fascinating combination of computer science, mathematics, and statistics.


Join 4,000+ makers on Adafruit’s Discord channels and be part of the community! http://adafru.it/discord

Learn “How Computers Work” with Bill Gates, Ladyada and more – From Code.org !

CircuitPython in 2018 – Python on Microcontrollers is here!

Have an amazing project to share? Join the SHOW-AND-TELL every Wednesday night at 7:30pm ET on Google+ Hangouts.

Join us every Wednesday night at 8pm ET for Ask an Engineer!

Follow Adafruit on Instagram for top secret new products, behinds the scenes and more https://www.instagram.com/adafruit/


Maker Business — Pololu’s New Machines

Wearables — Mystical elements

Electronics — Disable unused channels!

Biohacking — Two Blood Meters to Start Your Biohacking Adventure

Get the only spam-free daily newsletter about wearables, running a "maker business", electronic tips and more! Subscribe at AdafruitDaily.com !



No Comments

No comments yet.

Sorry, the comment form is closed at this time.