Black Lives Matter - Action and Equality. ... Adafruit is open and shipping.

Getting Started with Tokenization, Transformers and NLP #NLP #Tokenization #MachineLearning #Transformers @huggingface @MorganFunto

Screenshot of @huggingface Tweet announcing the release of several hands-on tutorials with tokenizers, transformers, and pipelines.


Earlier this month @huggingface released a number of notebooks that walk users through some NLP basics. The three-part series, written by @MorganFunto, covers tokenizers, transformers, and pipelines utilizing Hugging Face’s transformer library. The notebooks cover the basics on a high level and get you working in the code quickly. The notebooks written in Colab allows anyone to run the code in the browser. Here’s the intro from the tokenization notebook:

Before going deep into any Machine Learning or Deep Learning Natural Language Processing models, every practitioner should find a way to map raw input strings to a representation understandable by a trainable model. One very simple approach would be to split inputs over every space and assign an identifier to each word.

The repo contains official notebooks provided by hugging face but also has a call for transformer notebooks from the community:

…we would like to list here interesting content created by the community. If you wrote some notebook(s) leveraging transformers and would like be listed here, please open a Pull Request and we’ll review it so it can be included here.

In addition to the three-part series described above, there are notebooks on “How to train a language model” and “How to generate text“.  You can find more details about the transformer library in their repo or paper. You can also use a transformer to generate text in the browser with their “write with transformer” tool.


Written by Rebecca Minich, Product Analyst, Data Science at Google. Opinions expressed are solely my own and do not express the views or opinions of my employer.

We are angry, frustrated, and in pain because of the violence and murder of Black people by the police because of racism. We are in the fight AGAINST RACISM. George Floyd was murdered, his life stolen. The Adafruit teams have specific actions we’ve done, are doing, and will do together as a company and culture. We are asking the Adafruit community to get involved and share what you are doing. The Adafruit teams will not settle for a hash tag, a Tweet, or an icon change. We will work on real change, and that requires real action and real work together. That is what we will do each day, each month, each year – we will hold ourselves accountable and publish our collective efforts, partnerships, activism, donations, openly and publicly. Our blog and social media platforms will be utilized in actionable ways. Join us and the anti-racist efforts working to end police brutality, reform the criminal justice system, and dismantle the many other forms of systemic racism at work in this country, read more @

Stop breadboarding and soldering – start making immediately! Adafruit’s Circuit Playground is jam-packed with LEDs, sensors, buttons, alligator clip pads and more. Build projects with Circuit Playground in a few minutes with the drag-and-drop MakeCode programming site, learn computer science using the CS Discoveries class on, jump into CircuitPython to learn Python and hardware together, TinyGO, or even use the Arduino IDE. Circuit Playground Express is the newest and best Circuit Playground board, with support for CircuitPython, MakeCode, and Arduino. It has a powerful processor, 10 NeoPixels, mini speaker, InfraRed receive and transmit, two buttons, a switch, 14 alligator clip pads, and lots of sensors: capacitive touch, IR proximity, temperature, light, motion and sound. A whole wide world of electronics and coding is waiting for you, and it fits in the palm of your hand.

Join 20,000+ makers on Adafruit’s Discord channels and be part of the community!

Have an amazing project to share? The Electronics Show and Tell is every Wednesday at 7pm ET! To join, head over to YouTube and check out the show’s live chat – we’ll post the link there.

Join us every Wednesday night at 8pm ET for Ask an Engineer!

Follow Adafruit on Instagram for top secret new products, behinds the scenes and more

CircuitPython – The easiest way to program microcontrollers –

Maker Business — To make it through a tough business cycle, layoffs should be a last resort

Wearables — Everything in its place

Electronics — The Case Of The Disappearing Capacitance

Python for Microcontrollers — Python on Microcontrollers Newsletter: New Hardware, Python Releases and Much More! #Python #Adafruit #CircuitPython @circuitpython @micropython @ThePSF

Adafruit IoT Monthly — BLE Store Capacity Indicator, Aquarium Automation, and more!

Microsoft MakeCode — MakeCode Arcade Game Garden Jam!

EYE on NPI — Maxim’s Himalaya uSLIC Step-Down Power Module #EyeOnNPI @maximintegrated @digikey

New Products – Adafruit Industries – Makers, hackers, artists, designers and engineers! — NEW PRODUCT – ESP-PSRAM64H Chip – 64 Mbit Serial Pseudo SRAM – 3.3V 133 MHz

Get the only spam-free daily newsletter about wearables, running a "maker business", electronic tips and more! Subscribe at !

No Comments

No comments yet.

Sorry, the comment form is closed at this time.