0

February 19, 2015 AT 2:31 pm

How A HyperLogLog And Other Probabilistic Data Structures Work

Titus Brown’s Awesome Big Data Algorithms talk from PyCon 2013 is a fascinating look at probabilistic data structures and is worth a watch if you’re interested in computer science.  These sometimes mysterious structures with names like HyperLogLog and Bloom filter let you do seemingly impossible things like count more unique items than your computer has memory to store.  This is possible by using statistics to estimate values rather than trying to store them in memory.  This lets the structures scale to immense amounts of data, like what Google might process in a day crawling the entire internet!

In addition to Titus’ talk there’s a great explanation of the HyperLogLog data structure from Doug Turnbull that’s worth checking out too.  Probabilistic data structures are a fascinating combination of computer science, mathematics, and statistics.


Check out all the Circuit Playground Episodes! Our new kid’s show and subscribe!

Have an amazing project to share? Join the SHOW-AND-TELL every Wednesday night at 7:30pm ET on Google+ Hangouts.

Join us every Wednesday night at 8pm ET for Ask an Engineer!

Learn resistor values with Mho’s Resistance or get the best electronics calculator for engineers “Circuit Playground”Adafruit’s Apps!


Maker Business — Limor Fried featured in NYC’s HER BIG IDEA!

Wearables — Get concrete solutions

Electronics — Probe Compensation

Biohacking — Dr. Rita Levi-Montalcini was a Centenarian Gonzo Biohacker

Get the only spam-free daily newsletter about wearables, running a "maker business", electronic tips and more! Subscribe at AdafruitDaily.com !



No Comments

No comments yet.

Sorry, the comment form is closed at this time.