Bendersky, Diego Ariel, and Juan Miguel Santos. “LEARNING FROM THE ENVIRONMENT WITH A UNIVERSAL REINFORCEMENT FUNCTION”. International Journal of Computing 5, no. 3 (August 1, 2014): 68-74. Accessed April 3, 2025. https://computingonline.net/computing/article/view/410.