How A HyperLogLog And Other Probabilistic Data Structures Work
Titus Brown’s Awesome Big Data Algorithms talk from PyCon 2013 is a fascinating look at probabilistic data structures and is worth a watch if you’re interested in computer science. These sometimes mysterious structures with names like HyperLogLog and Bloom filter let you do seemingly impossible things like count more unique items than your computer has memory to store. This is possible by using statistics to estimate values rather than trying to store them in memory. This lets the structures scale to immense amounts of data, like what Google might process in a day crawling the entire internet!
In addition to Titus’ talk there’s a great explanation of the HyperLogLog data structure from Doug Turnbull that’s worth checking out too. Probabilistic data structures are a fascinating combination of computer science, mathematics, and statistics.
Stop breadboarding and soldering – start making immediately! Adafruit’s Circuit Playground is jam-packed with LEDs, sensors, buttons, alligator clip pads and more. Build projects with Circuit Playground in a few minutes with the drag-and-drop MakeCode programming site, learn computer science using the CS Discoveries class on code.org, jump into CircuitPython to learn Python and hardware together, or even use Arduino IDE. Circuit Playground Express is the newest and best Circuit Playground board, with support for MakeCode, CircuitPython, and Arduino. It has a powerful processor, 10 NeoPixels, mini speaker, InfraRed receive and transmit, two buttons, a switch, 14 alligator clip pads, and lots of sensors: capacitive touch, IR proximity, temperature, light, motion and sound. A whole wide world of electronics and coding is waiting for you, and it fits in the palm of your hand.