Monorepo, Polars LazyFrames, Text Embeddings With Parquet and Polars and more
with some more interesting news, articles, packages and projects
News
Call for Proposals for DjangoCon Africa 2025 is now open!
SCons 4.9.0 Released
Python-oracledb 3.0 Released
PyCon US 2025 Schedule is out
Articles
Cracking the Python Monorepo
Monorepo is a popular way to organize codebases, and it is widely used by tech giants like Google, Facebook and Twitter. In this article, Daniel Gafni shared his approach to building boilerplate-free and highly efficient Dagger pipelines for Python monorepos managed with uv
workspaces.
# Daniel Gafni
The Best Way to Use Text Embeddings Portably is With Parquet and Polars
Text embeddings, especially those from large language models are very useful. While vector databases are often recommended for finding similar embeddings, Max Woolf shared a simpler approach using Parquet and Polars - perfect if you're working with a smaller dataset.
# Max Woolf
How to Work With Polars LazyFrames
If you're working with large datasets, using LazyFrame instead of DataFrame can be more efficient, as Lazy evaluation optimizes query plans before materializing data. However, that doesn’t mean LazyFrames are always the best choice. In this article, Ian Eyre explained how to create & use them, when to leverage lazy evaluation and when to switch between lazy and eager modes.
# Ian Eyre
AI Engineering Goes Visual: Building an LLM RAG with PyFlyde & LangChain
I love newsletters - of course, that’s why I write one! But wouldn’t it be great to have a search tool to quickly find relevant articles and their content? Building a RAG application makes a lot of sense if you're thinking about something similar. In this two-part article series, Vladimir Sibirov covered how to do just that. Plus, you'll get introduced to Flyde, a flow-based programming tool that visually maps out your project’s workflow.
# Vladimir Sibirov
The Dangers of Deceptive Data–Confusing Charts and Misleading Headlines
Recognizing deception takes some expertise, especially when misleading charts are used to confuse people. Having a solid understanding of these tactics can help you avoid being misled. In this article, Murtaza Ali explained how to spot deceptive visualizations and what to avoid.
# Murtaza Ali
Interesting Packages and Projects to explore
Granian - A Rust HTTP server for Python applications built on top of the Hyper crate
More Itertools - More routines for operating on iterables, beyond itertools
djoser - REST implementation of Django authentication system
FastStream - Effortless event stream integration for your services
InstructLab - InstructLab Command-Line Interface. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data
About Upcoming Python Events
Leipzig Python User Group Meeting
March 11, 2025
PythonCDMX 🐍 Mar 2025 - Agentes de IA / Pysides 6
March 11, 2025
pyCologne User Group Treffen
March 11, 2025
Python Ireland March 2025 Meetup
March 12, 2025
Python Atlanta Meetup
March 13, 2025
BangPypers - Bangalore Python Users Group - Build with AI
March 15, 2025