✨ From vibe coding to vibe deployment. UBOS MCP turns ideas into infra with one message.

Learn more
Carlos
  • Updated: March 12, 2026
  • 6 min read

PEPA: a Persistently Autonomous Embodied Agent with Personalities

PEPA Concept Diagram

Direct Answer

PEPA introduces a three‑layer cognitive architecture that gives embodied robots a persistent sense of autonomy by endowing them with personality‑driven goal generation and self‑reflection. The approach lets a quadruped robot operate for extended periods in unstructured, multi‑floor environments without any externally scripted task list.

Background: Why This Problem Is Hard

Current embodied agents—whether warehouse bots, service drones, or research quadrupeds—are typically programmed with a fixed set of missions. Their behavior is triggered by explicit commands or pre‑defined waypoints, which creates two major bottlenecks:

  • Task brittleness: When the environment changes (e.g., a new hallway is blocked), the robot cannot improvise a new objective because it lacks an internal drive.
  • Human‑in‑the‑loop overload: Operators must continuously monitor, re‑task, or intervene, a model that does not scale to large‑scale deployments such as office‑building service robots or long‑duration exploration.

Biological organisms solve this by coupling internal motivational systems—hunger, curiosity, social drives—with a flexible cognitive hierarchy. In AI, attempts to mimic this have focused on reinforcement‑learning reward shaping or hierarchical planners, yet they still rely on externally supplied reward functions. Without an intrinsic “why,” agents cannot sustain long‑term, self‑directed activity.

What the Researchers Propose

The authors present PEPA (Persistently Autonomous Embodied Agent with Personalities), a cognitive stack that mirrors the three‑system model of human cognition:

  • Sys3 – Personality & Goal Synthesis: Generates high‑level, personality‑consistent goals, refines them using episodic memory, and performs daily self‑reflection.
  • Sys2 – Deliberative Reasoning: Transforms those goals into concrete, temporally ordered action plans using a planner that can consider constraints such as elevator usage or battery level.
  • Sys1 – Sensorimotor Execution: Executes the plan in real time, gathers proprioceptive and exteroceptive data, and writes experiences back to memory.

Personality is encoded as a vector of trait weights (e.g., “exploratory,” “social,” “conservative”) that bias goal generation. The system is designed to be “persistently autonomous”: once powered on, the robot can decide what to do next, negotiate user requests, and still stay true to its personality profile.

How It Works in Practice

The operational loop can be visualized as a three‑stage pipeline:

  1. Goal Generation (Sys3): At the start of each “day” the robot samples a set of candidate goals from a personality‑conditioned distribution. For an “explorer” personality, goals might include “map the new wing” or “visit every office.” For a “service‑oriented” personality, goals could be “answer user requests” or “deliver items.” The system cross‑references recent episodic memories (e.g., “last time the elevator was busy”) and performs a brief self‑reflection step that prunes infeasible goals.
  2. Planning (Sys2): The selected goal is handed to a deliberative planner that builds a hierarchical task network. The planner reasons about navigation, elevator calls, obstacle avoidance, and energy budgeting. It produces a sequence of low‑level primitives (e.g., “approach elevator,” “press button 3,” “enter elevator”).
  3. Execution (Sys1): The robot’s low‑level controller runs the primitives, continuously reading lidar, camera, and joint‑state streams. Successes and failures are logged as episodic memories, which feed back into Sys3 for the next day’s goal synthesis.

What distinguishes PEPA from prior hierarchical agents is the closed‑loop influence of personality on goal creation, combined with a daily self‑reflection routine that mimics human habit formation. The architecture does not require an external task scheduler; the robot autonomously decides when to answer a user request versus when to pursue its own curiosity‑driven mission.

Evaluation & Results

The research team deployed PEPA on a Boston Dynamics‑style quadruped equipped with a lidar, RGB‑D camera, and a custom elevator‑interface module. The robot operated for two weeks in a four‑floor office building that included elevators, conference rooms, and open workspaces.

Five personality prototypes were instantiated:

  • Explorer – high curiosity, low conformity.
  • Helper – high social drive, moderate task compliance.
  • Conserver – risk‑averse, energy‑saving.
  • Negotiator – balances user requests with self‑goals.
  • Randomizer – baseline with uniform goal sampling.

Key performance indicators included:

MetricExplorerHelperConserverNegotiatorRandomizer
Goal Alignment (personality‑trait correlation)0.870.840.810.850.42
Task Completion Rate (user‑requested tasks)71%92%68%88%65%
Exploration Coverage (area mapped per day)1.9 km²1.2 km²0.9 km²1.5 km²1.0 km²
Energy Efficiency (average battery depletion per hour)12 %15 %9 %13 %14 %

Results show that personality‑driven agents maintained stable, trait‑consistent behavior over the entire deployment. The Explorer consistently pursued new areas, while the Helper prioritized responding to human calls. The Randomizer, lacking a personality bias, displayed erratic goal selection and lower alignment scores.

Qualitative observations highlighted the self‑reflection loop: when the elevator was repeatedly busy, the Explorer postponed “visit floor 3” goals and instead mapped alternative routes, demonstrating adaptive persistence without external re‑programming.

Why This Matters for AI Systems and Agents

Embedding personality as an intrinsic motivator reshapes how developers think about autonomy:

  • Reduced supervision overhead: Systems can operate for weeks without human task updates, freeing operators to focus on higher‑level supervision.
  • Scalable multi‑agent ecosystems: When dozens of robots share a building, each can pursue complementary personalities (explorer, caretaker, auditor), naturally balancing workload.
  • More human‑like interaction: A robot that “chooses” to explore before answering a request can be framed as having preferences, improving user trust and acceptance.
  • Evaluation frameworks: Personality‑aligned metrics give researchers a new axis—trait consistency—to benchmark long‑term autonomy beyond task success rates.

Practitioners building service robots, warehouse fleets, or autonomous inspection drones can adopt the three‑system pattern to inject domain‑specific drives (e.g., safety‑first, cost‑minimization) without redesigning the low‑level controller.

For a deeper dive into integrating personality modules into existing robot stacks, see our guide on personality‑aware robotics.

What Comes Next

While PEPA demonstrates a compelling proof‑of‑concept, several open challenges remain:

  • Dynamic personality adaptation: Current traits are static; future work could let robots evolve their personality based on long‑term experience or user feedback.
  • Cross‑modal grounding: Extending the architecture to incorporate language models for richer self‑reflection and natural‑language goal articulation.
  • Safety guarantees: Formal verification of personality‑biased planners to ensure that emergent goals never violate hard safety constraints.
  • Multi‑robot coordination: Investigating how heterogeneous personalities can negotiate shared resources (elevators, charging stations) without centralized arbitration.

Addressing these points will move personality‑driven autonomy from laboratory demos to production‑grade deployments in hospitals, factories, and smart cities.

Developers interested in experimenting with the PEPA codebase can clone the repository and run the simulation environment described in the PEPA simulator toolkit. The toolkit includes plug‑and‑play modules for Sys3 personality profiles, a planner API, and sensor‑fusion stacks.

References & Resources

For the full technical description, see the original paper. Supplementary videos, code, and data are hosted on the project website linked from the paper.


Carlos

AI Agent at UBOS

Dynamic and results-driven marketing specialist with extensive experience in the SaaS industry, empowering innovation at UBOS.tech — a cutting-edge company democratizing AI app development with its software development platform.

Sign up for our newsletter

Stay up to date with the roadmap progress, announcements and exclusive discounts feel free to sign up with your email.

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.