Blog

Agentic Context Engineering

alt ACE

Large language model (LLM) applications such as agents and domain-specific reasoning increasingly rely on context adaptation—modifying inputs with instructions, strategies, or evidence, rather than weight updates. Prior approaches improve usability but often suffer from brevity bias, which drops domain insights for concise summaries, and from context collapse, where iterative rewriting erodes details over time. Building on the adaptive memory introduced by Dynamic Cheatsheet, we introduce ACE (Agentic Context Engineering), a framework that treats contexts as evolving playbooks that accumulate, refine, and organize strategies through a modular process of generation, reflection, and curation. ACE prevents collapse with structured, incremental updates that preserve detailed knowledge and scale with long-context models. Across agent and domain-specific benchmarks, ACE optimizes contexts both offline (e.g., system prompts) and online (e.g., agent memory), consistently outperforming strong baselines: +10.6% on agents and +8.6% on finance, while significantly reducing adaptation latency and rollout cost. Notably, ACE could adapt effectively without labeled supervision and instead by leveraging natural execution feedback. On the AppWorld leaderboard, ACE matches the top-ranked production-level agent on the overall average and surpasses it on the harder test-challenge split, despite using a smaller open-source model. These results show that comprehensive, evolving contexts enable scalable, efficient, and self-improving LLM systems with low overhead

Kualitas Pelajar Dipengaruhi Sekolah atau Pelajarnya

Hasil kualitas pendidikan apakah ditentukan oleh sekolah atau pelajarnya? Di negara maju, kualitas lebih banyak ditentukan oleh kualitas pelajarnya. Di negara berkembang, kualitas sangat dipengaruhi oleh kualitas sekolahnya. Argumennya adalah kualitas pendidikan di negara berkembang umumnya tidak merata, sehingga sangat berpengaruh pada kualitas anak didik. Fenomena ini dikenal sebagai Heyneman-Loxley effect".

Literatur #

Berikut ini paper tentang Heyneyman-Loxley effect: “The role of socioeconomic status and school quality in the Philippines: Revisiting the Heyneman–Loxley effect”

Shannon Autonomous Pentester

Shannon adalah perangkat lunak pentester berbasis Artificial Intelligence.

Fitur utama:

  • Beroperasi secara otonom, tidak perlu manual
  • Laporan pentester dengan exploit yang dapat direproduksi
  • Critical OWASP Vulnerability Coverage
  • Code-Aware Dynamic Testing
  • Powered by Integrated Security Tools
  • Parallel Processing for Faster Results

alt Shannon Screen

Sumber: #

Professional Software Developers Don't Vibe, They Control: AI Agent Use for Coding in 2025

The rise of AI agents is transforming how software can be built. The promise of agents is that developers might write code quicker, delegate multiple tasks to different agents, and even write a full piece of software purely out of natural language. In reality, what roles agents play in professional software development remains in question. This paper investigates how experienced developers use agents in building software, including their motivations, strategies, task suitability, and sentiments. Through field observations (N=13) and qualitative surveys (N=99), we find that while experienced developers value agents as a productivity boost, they retain their agency in software design and implementation out of insistence on fundamental software quality attributes, employing strategies for controlling agent behavior leveraging their expertise. In addition, experienced developers feel overall positive about incorporating agents into software development given their confidence in complementing the agents’ limitations. Our results shed light on the value of software development best practices in effective use of agents, suggest the kinds of tasks for which agents may be suitable, and point towards future opportunities for better agentic interfaces and agentic use guidelines.

Spec Kit

Spec Kit is An open source toolkit that allows you to focus on product scenarios and predictable outcomes instead of vibe coding every piece from scratch.

Spec-Driven Development means specifications become executable, directly generating working implementations rather than just guiding them.

Source:

Geospatial Segmentation

Open LLM

Beberapa model AI LLM open:

  • Writing: Kimi k2 / Thinking
  • Coding: Minimax M2 / GLM 4.6
  • OCR: DeepSeek / Qwen 3 VL
  • General: DeepSeek V3.2
  • Image: Flux 2 Dev / Z-Image
  • Reasoning : DeepSeek v3.2 speciale

Metode akses:

  • Writing : Official Chat UI Kimi
  • Coding : Zed / KiloCode
  • OCR : Official Chat / via API
  • General : Official Chat Deepsek
  • Reasoning : Official / via API
  • Image Editing dan Image : hugging face, ModelScope