rune/NewsAgent

Fork 0

Files

Rune Olsen f8684077a2 email testing

2026-01-26 12:55:56 +01:00

4.2 KiB

Raw Blame History

OpenRouter Model Reference

Quick reference for choosing the right AI model for News Agent.

Recommended Models

Best Overall: `google/gemini-flash-1.5-8b`

Cost: ~$0.05-0.15/day
Quality: Excellent
Speed: Very fast
Best for: Production use, daily digests

Best Free: `google/gemini-2.0-flash-exp:free`

Cost: FREE
Quality: Good (experimental)
Speed: Fast
Best for: Testing, low-budget setups
Note: Experimental, may change

Budget Option: `meta-llama/llama-3.1-8b-instruct:free`

Cost: FREE
Quality: Decent
Speed: Moderate
Best for: Testing, development

High Quality: `anthropic/claude-3.5-haiku`

Cost: ~$0.10-0.25/day
Quality: Excellent
Speed: Fast
Best for: When quality matters most

OpenAI Option: `openai/gpt-4o-mini`

Cost: ~$0.08-0.20/day
Quality: Very good
Speed: Fast
Best for: Balanced quality/cost

How to Change Model

Edit config.yaml:

ai:
  model: "google/gemini-flash-1.5-8b"  # Change this line

Full Model List

See all available models at: https://openrouter.ai/models

Filter by:

Free models - Search for :free suffix
Context length - Important for long articles
Supported features - JSON mode, function calling, etc.

Cost Comparison (Daily Estimates)

Based on processing ~50 articles/day with 15 summaries:

Model	Daily Cost	Monthly Cost	Quality
`google/gemini-2.0-flash-exp:free`	$0.00	$0.00	Good
`meta-llama/llama-3.1-8b-instruct:free`	$0.00	$0.00	Decent
`google/gemini-flash-1.5-8b`	$0.05-0.15	$1.50-4.50	Excellent
`openai/gpt-4o-mini`	$0.08-0.20	$2.40-6.00	Very Good
`anthropic/claude-3.5-haiku`	$0.10-0.25	$3.00-7.50	Excellent
`openai/gpt-4o`	$0.50-1.50	$15-45	Outstanding

Costs vary based on article length and quantity

Testing a New Model

Update config.yaml:
```
ai:
  model: "new-model-name"
```

Run a test:

source .venv/bin/activate
python -m src.main

Check quality of summaries in the email
Monitor costs at: https://openrouter.ai/activity

Model Selection Tips

Choose FREE models if:

You're testing the system
Cost is a major concern
Article quality/accuracy is less critical

Choose PAID models if:

You want best quality summaries
You need reliable daily digests
You value accurate relevance scoring

Performance vs Cost:

Gemini Flash 1.5-8b - Best balance
Claude Haiku - Best quality for cost
GPT-4o-mini - Good all-rounder
Free models - Testing/development

Troubleshooting

Error: "No endpoints found for [model]"

The model name is incorrect or not available on OpenRouter.

Solution:

Check model name at: https://openrouter.ai/models
Copy exact model ID (e.g., google/gemini-flash-1.5-8b)
Update config.yaml
Restart the service

Model Seems Slow

Some models are slower than others.

Solutions:

Try google/gemini-flash-1.5-8b (fastest)
Reduce max_articles in config (process fewer articles)
Use cheaper/faster models for filtering, premium for summarization

Poor Summary Quality

The model might not be good at summarization.

Solutions:

Try anthropic/claude-3.5-haiku (excellent summarization)
Adjust prompts in src/ai/prompts.py
Increase temperature for more creative summaries
Try different models

High Costs

You're using an expensive model or processing too many articles.

Solutions:

Switch to cheaper model (Gemini Flash or free options)
Increase min_score to filter more aggressively
Reduce max_articles limit
Monitor usage: https://openrouter.ai/activity

Advanced: Using Different Models for Different Tasks

You could modify the code to use:

Fast/cheap model for filtering (scoring articles)
High-quality model for summarization

Edit src/main.py to instantiate different clients for different tasks.

Need Help?

Model pricing: https://openrouter.ai/models (click model for details)
API docs: https://openrouter.ai/docs
Check costs: https://openrouter.ai/activity
Model comparison: Test different models and compare results

4.2 KiB Raw Blame History

OpenRouter Model Reference

Recommended Models

Best Overall: google/gemini-flash-1.5-8b

Best Free: google/gemini-2.0-flash-exp:free

Budget Option: meta-llama/llama-3.1-8b-instruct:free

High Quality: anthropic/claude-3.5-haiku

OpenAI Option: openai/gpt-4o-mini

How to Change Model

Full Model List

Cost Comparison (Daily Estimates)

Testing a New Model

Model Selection Tips

Choose FREE models if:

Choose PAID models if:

Performance vs Cost:

Troubleshooting

Error: "No endpoints found for [model]"

Model Seems Slow

Poor Summary Quality

High Costs

Advanced: Using Different Models for Different Tasks

Need Help?

4.2 KiB

Raw Blame History

Best Overall: `google/gemini-flash-1.5-8b`

Best Free: `google/gemini-2.0-flash-exp:free`

Budget Option: `meta-llama/llama-3.1-8b-instruct:free`

High Quality: `anthropic/claude-3.5-haiku`

OpenAI Option: `openai/gpt-4o-mini`