experience.log

reputation

пятница, 21 июня 2013 г. Posted by Unknown

Tag: RL

Tran, T., Cohen, R. (2002) A Reputation-Oriented RL Strategy for Agents in Electronic Marketplaces (proceedings, article)

Tag: Grid

Saravana Kumar, E., Sumathi, A. (2013) A Trust Based Ant Colony Optimized Grid Scheduling. Life Science Journal 2013,10(7s), pp.466-470 (article)
Yao Wang, Julita Vassileva (2007) Toward Trust and Reputation Based Web Service Selection: A Survey. In International Transactions on Systems Science and Applications (ITSSA) Journal, special Issue on New tendencies on Web Services and Multi-agent Systems (WS-MAS), 3(2) (article)
Kurian Alunkal, B., Veljkovic I., von Laszewski G., Amin, K. (2003) Reputation-Based Grid Resource Selection. In Workshop on Adaptive Grid Middleware, P.28 (article)
Zetuny, Yonatan and Terstyanszky, Gabor and Winter, Stephen and Kacsuk, Peter K. (2009) Adapted quality resource selection using the Grid reputation-policy trust management service. In: Weghorn, Hans and Roth, Jörg and Isaías, Pedro, (eds.) Proceedings of the IADIS International Conference Informatics 2009: part of the IADIS Multi Conference on Computer Science and Information Systems 2009, Algarve, Portugal, 17 - 23 June 2009. International Association for Development of the Information Society, pp. 11-18. ISBN 9789728924867 (article)
Du Ruizhong, Ma Xiaoxue, Wang Zixian, Zhang Fang (2010) A Grid Reputation Model Based on Service Quality. Proceedings of the 2010 Second International Conference on Networks Security, Wireless Communications and Trusted Computing, 01, pp.206-209 (reference)

grid

четверг, 11 апреля 2013 г. Posted by Unknown

Литвинов, В. В., Стеценко, І.В. (2012) Управління розподіленими ресурсами грід-системи. Математичні машини і системи, (2). (стаття - сети Петри)
Kanzariya D., Patel S. (2013) Survey on Resource Allocation in Grid. International Journal of Engineering and Innovative Technology (IJEIT), 8(2), pp. 129-133. (article)

Germain-Renaud, C., Perez, J. (2008) Grid Differentiated Services: a RL Approach. (article)
Jun Wu, Xin Xu, Pengcheng Zhang, Chunming Liu (2011) A novel multi-agent reinforcement learning approach for job scheduling in Grid computing. Future Generation Computer Systems, 27(5), pp. 430–439. (article, link)
Zhengli Zhai (2010) Grid resource selection based on reinforcement learning. Computer Application and System Modeling (ICCASM), 2010 International Conference on, 12, pp.644-647. (article)
Julien Perez C ecile Germain-Renaud
Balazs K egl Charles Loomis, Multi-objective Reinforcement Learning for Responsive
Grids (article)
http://content.yudu.com/Library/A20y4a/AgentbasedTaskSchedu/resources/index.htm

Online LSPI

Posted by Unknown

Li, L., Littman, M., L.& Mansley, C., R.(2009) Online Exploration in LSPI (slides,article,techreport)
Busoniu, L., Ernst, D., De Schutter, B., & Babuˇska, R.(2010) Online LSPI for RL control. In proceeding of: American Control Conference (ACC), 2010. (proceedings)
Busoniu, L., De Schutter, B., Babuˇska, R., & Ernst, D. (2010) Using prior knowledge to accelerate online LSPI (article?)
Bu¸soniu, L., De Schutter, B., Babuˇska, R., & Ernst, D. (2010) Exploiting policy
knowledge in online LSPI: An empirical study. Automation, Computers, Applied Mathematics, 19(4), pp. 521–529. (techreport)

LSPI

среда, 10 апреля 2013 г. Posted by Unknown

1. Fern, A., Batch RL Via LSPI (slides)
2. Elkan, C. (2012) Least squares policy iteration (LSPI) (article)
3. Busoniu, L., Lazaric, A., Ghavamzadeh, M., Munos, R., Babuska, R., and De Schutter, B. (2011) Least-squares methods for policy iteration. Reinforcement Learning: State of the Art. Springer. (article)
4. Lagoudakis, M., G., Parr, R., (2003) LSPI (article, code)
5. Lagoudakis, M., G., Parr, R., (2001) Model-Free LSPI (proceeding)

Lazaric, A., Ghavamzadeh, M. & Munos, R. (2011) Finite-sample analysis of least-squares policy iteration. Journal of Machine learning Research, 13:3041-3074.
Xu, X., Hu D., & Lu, X. (2007) Kernel-Based LSPI for RL (article)
Ma, J., & Powell, W. B. (2009) Convergence Proofs of LSPI Algorithm for High-Dimensional Infinite Horizon Markov Decision Process Problems (article)
Thiery, C., & Scherrer, B. (2010) Least-Squares lambda Policy Iteration: Bias-Variance Trade-off in Control Problems (article)

Фреймворки для RL

Posted by Unknown

Lucian Busoniu: MARL toolbox; Approximate RL and DP toolbox

Reinforcement Learning Toolbox 2.0 (last updated: 07. Nov 2006)

RLAI; JRLF;

Programmable RL Agents (article)

The Teaching-Box: A Universal Robot Learning Framework (article)

De Comité, F. (2005) A Java Platform for RL Experiments (article)

Andre, D., & Russell, S. (2000) Programmable RL Agents (article)

Полезные штуковины

вторник, 26 марта 2013 г. Posted by Unknown

Каталог научных ресурсов

http://www.scintific.narod.ru/
Содержит перечень научных поисковых систем, ссылки на книгосодержащие сайты (разбиты по наукам), ресурсы по темам: нейросети, численные методы, нелинейная динамика.

Онлайн-доска
http://realtimeboard.com/

Сервис, позволяющий творить свою доску. Поддерживает работу с гуглодиском. Есть возможность строить диаграммы, схемы, рисовать что-либо "от руки", добавлять на доску документы и шарить это с остальными.

Some advices

пятница, 22 марта 2013 г. Posted by Unknown

List of advices for PhD students.
Reasons of starting science blog.
Some facts about doing PhD.
Useful advices about scientific writing for non-native English speakers.