All articles

research
May 12, 2026
Hosting Qwen on Blackwell

May 6, 2026
CuTeDSL at Perplexity

research
May 1, 2026
Designing, Refining, and Maintaining Agent Skills at Perplexity

research
Apr 22, 2026
Advancing Search-Augmented Language Models

Feb 26, 2026
pplx-embed: State-of-the-Art Embedding Models for Web-Scale Retrieval
Today we are releasing pplx-embed-v1 and pplx-embed-context-v1, two state-of-the-art text embedding models built for real-world, web-scale retrieval.

research
Feb 4, 2026
Evaluating Deep Research Performance in the Wild with the DRACO Benchmark
DRACO: a Cross-Domain Benchmark for Deep Research Accuracy, Completeness, and Objectivity

security
Dec 2, 2025
BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents
Defense architecture, benchmark, and detection model for securing AI agents in open-world web environments.

systems
Nov 5, 2025
RDMA Point-to-Point Communication for LLM Systems
Elegant tool to address emerging LLM communication patterns

systems
Nov 4, 2025
Enabling Trillion-Parameter Models on AWS EFA
Make trillion-parameter models available with cloud platform portability
Load more

research
May 12, 2026
Hosting Qwen on Blackwell

May 6, 2026
CuTeDSL at Perplexity

research
May 1, 2026
Designing, Refining, and Maintaining Agent Skills at Perplexity

research
Apr 22, 2026
Advancing Search-Augmented Language Models

Feb 26, 2026
pplx-embed: State-of-the-Art Embedding Models for Web-Scale Retrieval
Today we are releasing pplx-embed-v1 and pplx-embed-context-v1, two state-of-the-art text embedding models built for real-world, web-scale retrieval.

research
Feb 4, 2026
Evaluating Deep Research Performance in the Wild with the DRACO Benchmark
DRACO: a Cross-Domain Benchmark for Deep Research Accuracy, Completeness, and Objectivity

security
Dec 2, 2025
BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents
Defense architecture, benchmark, and detection model for securing AI agents in open-world web environments.

systems
Nov 5, 2025
RDMA Point-to-Point Communication for LLM Systems
Elegant tool to address emerging LLM communication patterns
Load more

research
May 12, 2026
Hosting Qwen on Blackwell

May 6, 2026
CuTeDSL at Perplexity

research
May 1, 2026
Designing, Refining, and Maintaining Agent Skills at Perplexity

research
Apr 22, 2026
Advancing Search-Augmented Language Models

Feb 26, 2026
pplx-embed: State-of-the-Art Embedding Models for Web-Scale Retrieval
Today we are releasing pplx-embed-v1 and pplx-embed-context-v1, two state-of-the-art text embedding models built for real-world, web-scale retrieval.

research
Feb 4, 2026
Evaluating Deep Research Performance in the Wild with the DRACO Benchmark
DRACO: a Cross-Domain Benchmark for Deep Research Accuracy, Completeness, and Objectivity
Load more