Compact · backend
Use Llama 3.1 8B Turbo with Flask
Updated 2026-05-21 · By XALEN
How to use Llama 3.1 8B Turbo (Compact, 8B) with Flask. Install, authenticate, and make your first API call in minutes. Working code example included.
Model
Llama 3.1 8B Turbo
8B · 128K context · $0.01 input
Framework
Flask
backend · pip install xalen flask
1. Install
pip install xalen flask
2. Code
from flask import Flask, request, jsonify
from xalen import XALEN
app = Flask(__name__)
client = XALEN(api_key="xln_test_YOUR_KEY")
@app.route("/chat", methods=["POST"])
def chat():
data = request.json
response = client.chat.completions.create(
model="llama-3-1-8b-turbo",
messages=data["messages"]
)
return jsonify({"reply": response.choices[0].message.content})
if __name__ == "__main__":
app.run()
Llama 3.1 8B Turbo with Other Frameworks
Other Models with Flask
200+ models. One API. Works with any framework.
Get API KeyLast updated: 2026-05-21