Open Source · backend
Use Llama 3.1 405B with Flask
Updated 2026-05-21 · By XALEN
How to use Llama 3.1 405B (Open Source, 405B) with Flask. Install, authenticate, and make your first API call in minutes. Working code example included.
Model
Llama 3.1 405B
405B · 128K context · $0.08 input
Framework
Flask
backend · pip install xalen flask
1. Install
pip install xalen flask
2. Code
from flask import Flask, request, jsonify
from xalen import XALEN
app = Flask(__name__)
client = XALEN(api_key="xln_test_YOUR_KEY")
@app.route("/chat", methods=["POST"])
def chat():
data = request.json
response = client.chat.completions.create(
model="llama-3-1-405b",
messages=data["messages"]
)
return jsonify({"reply": response.choices[0].message.content})
if __name__ == "__main__":
app.run()
Llama 3.1 405B with Other Frameworks
Other Models with Flask
200+ models. One API. Works with any framework.
Get API KeyLast updated: 2026-05-21