NumPy 2.0.0 Released, Mesop, Data Extraction from Unstructured PDFs and more
A Beautiful and Timely Python Multi-page Streamlit Application, Ruff: Internals of a Rust-backed Python linter-formatter with some more interesting news, articles, packages and projects
News
Polars 1.0.0-alpha.1 released
Polars 1.0 will be released soon. Exciting times ahead.
NumPy 2.0.0 Released
This major release has many new features including breaking changes.
The Python Language Summit 2024: Lightning Talks
If you want to know what will come in Python or what's happening in the core Python world then you should check the Python Language Summit. 2024 Lightning talks are “Rust in Python, Formalizing the PEP prototype process, improving asserts in 3.14, sharing data between sub-interpreters, and more”.
Malicious VSCode extensions with millions of installs discovered
Looks like bad actors are everywhere. This time it's VSCode. A group of Israeli researchers explored the security of the Visual Studio Code marketplace and managed to "infect" over 100 organizations by trojanizing a copy of the popular 'Dracula Official' theme to include risky code.
psutil 6.0.0 released
Major release after 7 years. Packed with many new features. Be careful there are some changes which affect backward compatibility.
Articles
Mesop - Quickly build web UIs in Python
A new kid in town which is an alternative to Streamlit. Mesop is a Python-based UI framework that allows you to rapidly build web apps. Used at Google for rapid internal app development similar to Streamlit.
Data Extraction from Unstructured PDFs
In this article, Ashish Choudhary explained how to extract the data from PDF files using the PyMuPDF package, what other good packages there to data extraction from PDF files & also explained about annot which is similar to bounding box and rectangles.
# Ashish Choudhary
A Beautiful and Timely Python Multi-page Streamlit Application
Streamlit is an open-source app framework that helps to create interactive web applications with ease. John Loewen took historical summer Olympic data and showed how to build multiple data visualizations in this article.
# John Loewen
Ruff: Internals of a Rust-backed Python linter-formatter - Part 1
Abdur-Rahmaan Janhangeer went to great lengths to explain what Ruff is & how Ruff works. Explained what all the rules of many packages from scratch implemented, how it started by Charlie Marsh and many more things.
# Abdur-Rahmaan Janhangeer
Why I love using Cookiecutter Django
It’s good to use cookiecutter if you are building a new project because it contains all the best practices and some boilerplate code which we usually write for every project that comes with it so it saves our time also. Like these several reasons why we should use cookiecutter, Simeon Emanuilov covered them in this article.
# Simeon Emanuilov
How to Setup Automatic Versioning for your FastAPI App
Amir Lavasani explained about semantic versioning, why to do it & how to do it in this article. He wrote for an application building using the FastAPI framework but it gives a pretty good idea to use it for any framework application.
# Amir Lavasani
10 Best Practices for Data Science
Benedict Neo mentioned 10 best practices for Data Science Projects. These are “Start Organized, Stay Organized, Version Control is Basic Professionalism, Tests and Sanity Checks Prevent Catastrophes and more”. Those are gold. Understand them and follow them in your projects.
# Benedict Neo
Interesting Packages and Projects to explore
Pillow - Python Imaging Library (Fork)
python-slugify - A Python slugify application that handles unicode.
language-tool-python - A free Python grammar checker 📝✅
Tenacity - Retrying library for Python
PyInstaller - Freeze (package) Python programs into stand-alone executables
About Upcoming Python Events
PyRustLin: PyDelhi x Rust Delhi x ILUG-D Meetup
June 22, 2024
PyLadies Amsterdam: Introduction to LLM Agents with LangChain
June 25, 2024
North Bay Python 2024
Jun 29 - 30, 2024