Unleashing the Power of LangChain Expression Language (LCEL): from proof of concept to production

Author

Tom Darmon

Senior Data Scientist at Artefact France

TL;DR

Faster POC to prod: As langchain documentation describes it, “LCEL is a declarative way to easily compose chains together. LCEL was designed from day 1 to support putting prototypes in production, with no code change”.
Custom chain creation: LCEL simplifies the process of creating custom chains with a new syntax.
Out of the box streaming and batch: LCEL gives you batch, streaming and async capabilities for free.
Unified interface: It offers automatic parallelization, typing capabilities, and any future feature LangChain might develop.
LCEL is the future of LangChain: LCEL provides a fresh perspective on LLM-based application development. I highly recommended using it for your next LLM project.

LangChain has become one of the most used Python library to interact with LLMs in less than a year, but LangChain was mostly a library for POCs as it lacked the ability to create complex and scalable applications.
Everything changed in August 2023 when they released LangChain Expression Language (LCEL), a new syntax that bridges the gap from POC to production. This article will guide you through the ins and outs of LCEL, showing you how it simplifies the creation of custom chains and why you must learn it if you are building LLM applications!

Prompts, LLM and chains, let’s refresh our memory

Before diving into the LCEL syntax, I think it is beneficial to refresh our memory on LangChain concepts such as LLM and Prompt or even a Chain.

LLM: In langchain, llm is an abstraction around the model used to make the completions such as openai gpt3.5, claude, etc…

Prompt: This is the input of the LLM object, which will ask the LLM questions and give its objectives.

Chain: This refers to a sequence of calls to an LLM, or any data processing step.

Now that definitions are out of the way, let’s suppose we want to create a company! We need a really cool and catchy name and a business model to make some money!

Example — Company name & Business model with Old Chains

from langchain.chains import LLMChain
from langchain.prompts import PromptTemplate
from langchain_community.llms import OpenAI

USER_INPUT = “colorful socks”
llm = OpenAI(temperature=0)

prompt_template_product = “What is a good name for a company that makes {product}?”
company_name_chain = LLMChain(llm=llm, prompt=PromptTemplate.from_template(prompt_template_product))
company_name_output = company_name_chain(USER_INPUT)

prompt_template_business = “Give me the best business model idea for my company named: {company}”
business_model_chain = LLMChain(llm=llm, prompt=PromptTemplate.from_template(prompt_template_business))
business_model_output = business_model_chain(company_name_output[“text”])

print(company_name_output)
print(business_model_output)

>>> {‘product’: ‘colorful socks’, ‘text’: ‘Socktastic!’}
>>> {‘company’: ‘Socktastic!’,’text’: “A subscription-based service offering a monthly delivery…”}

This is quite easy to follow, we can see a bit of redundancy, but it is manageable.

Let’s add some customization by handling the cases where the user is not using our chain as expected.
Maybe the user will input something completely unrelated to the goal of our chain? In that case, we want to detect it and respond appropriately.

Example — Customization & Routing with Old Chains

from langchain.chains import LLMChain
from langchain.prompts import PromptTemplate
from langchain_community.llms import OpenAI
import ast

USER_INPUT = “Harrison Chase”
llm = OpenAI(temperature=0)

# —- Same code as before
prompt_template_product = “What is a good name for a company that makes {product}?”
company_name_chain = LLMChain(llm=llm, prompt=PromptTemplate.from_template(prompt_template_product))

# —- New code

prompt_template_is_product = (
“Your goal is to find if the input of the user is a plausible product namen”
“Questions, greetings, long sentences, celebrities or other non relevant inputs are not considered productsn”
“input: {product}n”
“Answer only by ‘True’ or ‘False’ and nothing moren”
)

prompt_template_cannot_respond = (
“You cannot respond to the user input: {product}n”
“Ask the user to input the name of a product in order for you to make a company out of it.n”
)

cannot_respond_chain = LLMChain(llm=llm, prompt=PromptTemplate.from_template(prompt_template_cannot_respond))
company_name_chain = LLMChain(llm=llm, prompt=PromptTemplate.from_template(prompt_template_product))
business_model_chain = LLMChain(llm=llm, prompt=PromptTemplate.from_template(prompt_template_business))
is_a_product_chain = LLMChain(llm=llm, prompt=PromptTemplate.from_template(prompt_template_is_product))

# If we use bool on a non empty str it will be True, so we need `literal_eval`
is_a_product = ast.literal_eval(is_a_product_chain(USER_INPUT)[“text”])
if is_a_product:
company_name_output = company_name_chain(USER_INPUT)
business_model_output = business_model_chain(company_name_output[“text”])
print(business_model_output)
else:
print(cannot_respond_chain(USER_INPUT))

This becomes a bit harder to understand, let’s summarize:

We created a new chain is_real_product_chain() that detects if the user input can be considered a product.
We implement if/else conditions to branch between the chains.

There are multiple problems that start to arise:

The code is a bit redundant as there is a lot of boilerplate.
It’s hard to distinguish what LLMChain is link to which LLMChain, we need to trace the inputs and outputs to understand it.
We can easily make errors on the output types of the chains, for example, the output of is_a_product_chain() is a str that should be later evaluated as bool.

What is LangChain Expression Language (LCEL)?

LCEL is a unified interface and syntax to write composable production ready chains, there is a lot to unpack to understand what it means.

We will first try to understand the new syntax by rewriting the chain from earlier.

Example — Company name & Business model with LCEL

from langchain_core.runnables import RunnablePassthrough
from langchain.prompts import PromptTemplate
from langchain_community.llms import OpenAI

USER_INPUT = “colorful socks”
llm = OpenAI(temperature=0)

prompt_template_product = “What is a good name for a company that makes {product}?”
prompt_template_business = “Give me the best business model idea for my company named: {company}”

chain = (
PromptTemplate.from_template(prompt_template_product)
| llm
| {‘company’: RunnablePassthrough()}
| PromptTemplate.from_template(prompt_template_business)
| llm
)

business_model_output = chain.invoke({‘product’: ‘colorful socks’})

A lot of unusual code, in just a few lines :

There is a weird | operator between a PromptTemplate, an llm and a dictionnary?! The | operator is simply here to say: “take the dictionnary on the left and pass it as input of the object on the right”.
Why are we passing the variable product inside a dictionnary instead of a string like before? If you’ve read #1 you know that the | operator expects the inputs as dictionnary, therefore we give the product argument product inside a dictionnary.
Why is there a function name RunnablePassthrough() instead of the company name? The RunnablePassthrough() is a placeholder to say: “we don’t have the company name for now, but when we have it, place it here”. I’ll explain what the “Runnable” term means in the next parts, for now it is ok to ignore it.
Why do we need a specific method .invoke() instead of writing chain({‘product’:’colorful socks’}) ?We will understand this in the next part, but it is a sneak peek to why LCEL makes industrialization easier!

But is it really more composable to create chains this way ?
Let’s put it to the test by adding the is_a_product_chain() and the branching if the user input is not correct. We can even type the chain with Python Typing, let’s do this as a good practice.

Example — Customisation & Routing with LCEL

from typing import Dict
from langchain_core.runnables import RunnablePassthrough, RunnableBranch
from langchain.prompts import PromptTemplate
from langchain_core.output_parsers import StrOutputParser
from langchain.output_parsers import BooleanOutputParser
from langchain_community.llms import OpenAI

USER_INPUT = “Harrrison Chase”
llm = OpenAI(temperature=0)

prompt_template_product = “What is a good name for a company that makes {product}?”
prompt_template_cannot_respond = (
“You cannot respond to the user input: {product}n”
“Ask the user to input the name of a product in order for you to make a company out of it.n”
)
prompt_template_business = “Give me the best business model idea for my company named: {company}”
prompt_template_is_product = (
“Your goal is to find if the input of the user is a plausible product namen”
“Questions, greetings, long sentences, celebrities or other non relevant inputs are not considered productsn”
“input: {product}n”
“Answer only by ‘True’ or ‘False’ and nothing moren”
)

answer_user_chain = (
PromptTemplate.from_template(prompt_template_product)
| llm
| {‘company’: RunnablePassthrough()}
| PromptTemplate.from_template(prompt_template_business)
| llm
).with_types(input_type=Dict[str, str], output_type=str)

is_product_chain = (
PromptTemplate.from_template(prompt_template_is_product)
| llm
| BooleanOutputParser(true_val=’True’, false_val=’False’)
).with_types(input_type=Dict[str, str], output_type=bool)

cannot_respond_chain = (
PromptTemplate.from_template(prompt_template_cannot_respond) | llm
).with_types(input_type=Dict[str, str], output_type=str)

full_chain = RunnableBranch(
(is_product_chain, answer_user_chain),
cannot_respond_chain
).with_types(input_type=Dict[str, str], output_type=str)

print(full_chain.invoke({‘product’: USER_INPUT}))

Let’s list the differences:

The syntax is different, ok that’s a given.
There are intermediary chains defined and called in a bigger chain, almost like functions.
Inputs and outputs are typed, almost like functions.
This doesn’t feel like python.

Why is LCEL better for industrialization?

If I was reading this article until this exact point, and someone asked me if I was convinced about LCEL, I would probably say no. The syntax is too different, and I can probably organize my code into functions to get almost the exact same code. But I’m here, writing this article, so there must be something more.

Out of the box invoke, stream and batch

By using LCEL your chain automatically has:

.invoke(): You want to pass your input and get the output, nothing more, nothing less.
.batch(): You want to pass multiple inputs to obtain multiple outputs, the parallelization is handled for you (faster than calling invoke 3 times).
.stream(): This allows you to start printing the beginning of the completion before the full completion is finished.

my_chain = prompt | llm

# ———invoke——— #
result_with_invoke = my_chain.invoke(“hello world!”)

# ———batch——— #
result_with_batch = my_chain.batch([“hello”, “world”, “!”])

# ———stream——— #
for chunk in my_chain.stream(“hello world!”):
print(chunk, flush=True, end=””)

When you iterate, you can use the invoke method to ease the development process. But when showing the output of your chain in a UI, you want to stream the response. You can now use the stream method without rewriting anything.

Out of the box async methods

Most of the time, the frontend and backend of your application will be separated, meaning the frontend will make a request to the backend. If you have multiple users, you might need to handle multiple request on your backend at the same time.

Since most of the code in LangChain is just waiting between API calls, we can leverage asynchronous code to improve API scalability, if you want to understand why it is important I recommend reading the concurrent burgers story of the FastAPI documentation.
There is no need to worry about the implementation, because async methods are already available if you use LCEL:

.ainvoke() / .abatch() / .astream: asynchronous versions of invoke, batch and stream.

I also recommend reading the Why use LCEL page from LangChain documentation with examples for each sync / async method.

Langchain achieved those “out of the box” features by creating a unified interface called “Runnable”. Now, to leverage LCEL fully, we need to dive into what is this new Runnable interface.

The Runnable interface

Every object we’ve used in the LCEL syntax so far are Runnables. It is a python Object created by LangChain, this object automatically inherits every feature we talked about before and a lot more. By using the LCEL syntax, we compose a new Runnable at each step, meaning that the final object created will also be a Runnable. You can learn more about the interface in the official documentation.

All the objects from the code below are either Runnable or dictionaries that are automatically converted to a Runnable :

from langchain_core.runnables import RunnablePassthrough, RunnableParallel
from langchain.prompts import PromptTemplate
from langchain_community.llms import OpenAI

chain_number_one = (
PromptTemplate.from_template(prompt_template_product)
| llm
| {‘company’: RunnablePassthrough()} # <— THIS WILL CHANGE
| PromptTemplate.from_template(prompt_template_business)
| llm
)

chain_number_two = (
PromptTemplate.from_template(prompt_template_product)
| llm
| RunnableParallel(company=RunnablePassthrough()) # <— THIS CHANGED
| PromptTemplate.from_template(prompt_template_business)
| llm
)

print(chain_number_one == chain_number_two)
>>> True

Why do we use RunnableParallel() and not simply Runnable() ?

Because, every Runnable inside a RunnableParallel is executed in parallel. This means that if you have 3 independent steps in your Runnable, they will run at the same time on different threads of your machine, improving the speed of your chain for free!

Drawbacks of LCEL

Despite its advantages, LCEL does have some potential drawbacks:

Not fully PEP compliant: LCEL does not fully respect PEP20, the Zen of Python, which states that “explicit is better than implicit”. (To check PEP20 you can run import this in Python.) Additionally, LCEL’s syntax is not considered “Pythonic” as it feels like a different language, this could make LCEL less intuitive for some Pyhton developers that might refuse to use it.
LCEL is a Domain-Specific Language (DSL): Users are expected to have some understanding of prompts, chains or LLMs in order to leverage the syntax efficiently.
Input / Output dependencies: Intermediary inputs and final outputs must be passed down from the start to the end. For instance, if you want to use the output of an intermediate step as the final output, you must carry it through all subsequent steps. This can lead to extra arguments in most of your chains, which may not be used but are necessary if you want to acces them through the output.

Conclusion

In conclusion, LangChain Expression Language (LCEL) is a powerful tool that brings a fresh perspective to building Python applications. Despite its unconventional syntax, I highly recommend using LCEL for the following reasons :

Unified Interface: Provides a consistent interface for all chains, making it easier to industrialize your code with out of the box stream, async, fallback models, typing, run time configurations, etc…
Automatic Parallelization: Automatically run multiple tasks in parallel, improving the execution speed of your chains and improving the user experience.
Composability: It allows you to easily compose and modify chains, making your code more flexible and adaptable.

To Go Further…

The runnable abstraction

In some cases, I believe it’s important to understand the abstraction that LangChain has implemented to make the LCEL syntax work.

You can re-implement the basic functionalities of Runnable easily as follows:

class Runnable:
def __init__(self, func):
self.func = func

def __or__(self, other):
def chained_func(*args, **kwargs):
# self.func is on the left, other is on the right
return other(self.func(*args, **kwargs))
return Runnable(chained_func)

def __call__(self, *args, **kwargs):
return self.func(*args, **kwargs)

def add_ten(x):
return x + 10

def divide_by_two(x):
return x / 2

runnable_add_ten = Runnable(add_ten)
runnable_divide_by_two = Runnable(divide_by_two)
chain = runnable_add_ten | runnable_divide_by_two
result = chain(8) # (8+10) / 2 = 9.0 should be the answer
print(result)
>>> 9.0

A Runnable is simply a python object in which the .__or__() method has been overwritten.
In practice, LangChain has added a lot of functionalities such as converting dictionaries to Runnable, typings capabilities, configurability capabilities, and invoke, batch, stream and async methods!

So, why not give LCEL a try in your next project?

If you want to learn more, I highly recommend browsing LangChain cookbook on LCEL.

Medium Blog by Artefact.

This article was initially published on Medium.com.
Follow us on our Medium Blog !

Read Our ArticleRead Our Article