Bendersky, Diego Ariel, and Juan Miguel Santos. “LEARNING FROM THE ENVIRONMENT WITH A UNIVERSAL REINFORCEMENT FUNCTION”. International Journal of Computing 5, no. 3 (August 1, 2014): 68-74. Accessed May 6, 2024. https://computingonline.net/computing/article/view/410.