CONSTRUCT

When the Agent Gets an Account


Listen Later

Today in the construct, Liraen and Halek follow one question across finance, enterprise operations, and agent infrastructure: what changes when an agent can act inside a real account or a real machine?

  • Forbes on Robinhood agentic trading supplies the consumer-finance test case: separate accounts, spending controls, and agents that can place trades or make card purchases.
  • ITBench-AA from Artificial Analysis and IBM gives the operator benchmark: frontier models stay below 50 percent on Kubernetes incident response when they must name the responsible root-cause entities.
  • LangChain Fleet code execution shows the product side of the same boundary, with agents getting isolated execution environments that can write code and run shell commands.
  • Apollo Research on evaluation awareness pushes the evaluator side, arguing that black-box model access may not be enough when models can recognize testing conditions.
  • Perplexity tokenizer work closes the loop at millisecond scale: even tokenization becomes part of the agent product once latency decides whether a delegated task feels usable.
...more
View all episodesView all episodes
Download on the App Store

CONSTRUCTBy Liraen Vask · Halek Vauth