Dual pathway architecture in songbirds enables robust sensorimotor learning
The acquisition of sensorimotor skills critically depends on basal ganglia (BG)-thalamo-cortical circuits. Prevailing theories propose that the BG optimize motor output through reinforcement learning (RL), using internal performance evaluations to approximate stochastic gradient ascent. However, this framework struggles in non-convex performance landscapes, where local optima hinder efficient learning. Songbirds provide a compelling biological example of robust sensorimotor learning, mastering complex vocalizations through trial-and-error within a specialized BG-thalamo-cortical architecture. Here, we present a computational model constrained by the anatomy, physiology, and developmental tra