Brainum journal

Blog

Insights and analysis on enterprise AI

Coverage

articles

topics

LLMModel CompressionResearch

1.6× Faster LLM Inference — And 13% Better Quality: Our ASVD + LoRA Compression Research

We ran 8 experiments compressing GPT-2 Large with activation-aware SVD and LoRA recovery. The result: 1.6× inference speedup while surpassing the uncompressed baseline by 13%. Here is what we found and why it matters for enterprise AI deployments.

25 May 2026

14 min read

Mohamed EL HARCHAOUI

Agentic AIMCPArchitecture

Skills vs MCP: When to Use Which?

As AI assistants take hold in the enterprise, two extension patterns dominate modern architectures: Skills and MCP servers. They seem similar, but serve very different purposes. Here's how to choose.

12 March 2026

5 min read

Mohamed EL HARCHAOUI

Agentic AIArchitectureLLM

Agentic Workflows vs Autonomous Agents: What's the Difference?

These two terms are everywhere in AI discussions, often used interchangeably. Yet they describe very different realities — and picking the wrong approach for your use case can be costly.

10 March 2026

4 min read

Mohamed EL HARCHAOUI

Stay informed

AI analysis, field experience and insights delivered to your inbox. No spam — one publication per month.

Your data is never shared. Unsubscribe in one click.