Blog | Ilmu Pengetahuan & Teknologi

Agentic Context Engineering

December 17, 2025

alt ACE

Large language model (LLM) applications such as agents and domain-specific reasoning increasingly rely on context adaptation—modifying inputs with instructions, strategies, or evidence, rather than weight updates. Prior approaches improve usability but often suffer from brevity bias, which drops domain insights for concise summaries, and from context collapse, where iterative rewriting erodes details over time. Building on the adaptive memory introduced by Dynamic Cheatsheet, we introduce ACE (Agentic Context Engineering), a framework that treats contexts as evolving playbooks that accumulate, refine, and organize strategies through a modular process of generation, reflection, and curation. ACE prevents collapse with structured, incremental updates that preserve detailed knowledge and scale with long-context models. Across agent and domain-specific benchmarks, ACE optimizes contexts both offline (e.g., system prompts) and online (e.g., agent memory), consistently outperforming strong baselines: +10.6% on agents and +8.6% on finance, while significantly reducing adaptation latency and rollout cost. Notably, ACE could adapt effectively without labeled supervision and instead by leveraging natural execution feedback. On the AppWorld leaderboard, ACE matches the top-ranked production-level agent on the overall average and surpasses it on the harder test-challenge split, despite using a smaller open-source model. These results show that comprehensive, evolving contexts enable scalable, efficient, and self-improving LLM systems with low overhead

ARTEMIS, hacker dengan artificial intelligence

December 17, 2025

Artificial Intelligence, Cybersecurity

Stanford berhasil membuat hacker berbasis Artificial Intelligence bernama ARTEMIS. Software hacker ini berhasil membobol sebagian aplikasi di jaringannya.

alt artemis

Sumber: #

Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing

Kualitas Pelajar Dipengaruhi Sekolah atau Pelajarnya

December 17, 2025

Education

Hasil kualitas pendidikan apakah ditentukan oleh sekolah atau pelajarnya? Di negara maju, kualitas lebih banyak ditentukan oleh kualitas pelajarnya. Di negara berkembang, kualitas sangat dipengaruhi oleh kualitas sekolahnya. Argumennya adalah kualitas pendidikan di negara berkembang umumnya tidak merata, sehingga sangat berpengaruh pada kualitas anak didik. Fenomena ini dikenal sebagai Heyneman-Loxley effect".

Literatur #

Berikut ini paper tentang Heyneyman-Loxley effect: “The role of socioeconomic status and school quality in the Philippines: Revisiting the Heyneman–Loxley effect”

Shannon Autonomous Pentester

December 17, 2025

Artificial Intelligence, Cybersecurity

Shannon adalah perangkat lunak pentester berbasis Artificial Intelligence.

Fitur utama:

Beroperasi secara otonom, tidak perlu manual
Laporan pentester dengan exploit yang dapat direproduksi
Critical OWASP Vulnerability Coverage
Code-Aware Dynamic Testing
Powered by Integrated Security Tools
Parallel Processing for Faster Results

alt Shannon Screen

Sumber: #

Github Shannon

Vulnerable API

December 17, 2025

Artificial Intelligence

Vulnerable API adalah contoh software yang di dalamnya berisi vulnerability OWASP top 10.

alt VAmPI

Sumber: #

Professional Software Developers Don't Vibe, They Control: AI Agent Use for Coding in 2025

December 16, 2025

Artificial Intelligence, Software Engineering

The rise of AI agents is transforming how software can be built. The promise of agents is that developers might write code quicker, delegate multiple tasks to different agents, and even write a full piece of software purely out of natural language. In reality, what roles agents play in professional software development remains in question. This paper investigates how experienced developers use agents in building software, including their motivations, strategies, task suitability, and sentiments. Through field observations (N=13) and qualitative surveys (N=99), we find that while experienced developers value agents as a productivity boost, they retain their agency in software design and implementation out of insistence on fundamental software quality attributes, employing strategies for controlling agent behavior leveraging their expertise. In addition, experienced developers feel overall positive about incorporating agents into software development given their confidence in complementing the agents’ limitations. Our results shed light on the value of software development best practices in effective use of agents, suggest the kinds of tasks for which agents may be suitable, and point towards future opportunities for better agentic interfaces and agentic use guidelines.

Spec Kit

December 13, 2025

Software Engineering

Spec Kit is An open source toolkit that allows you to focus on product scenarios and predictable outcomes instead of vibe coding every piece from scratch.

Spec-Driven Development means specifications become executable, directly generating working implementations rather than just guiding them.

Source: