Demo:

Fault tolerance for Internet agent systems: in case of stop failure and Byzantine failure
Tadashi Araragi


In this demonstration, we show our two practical fault-tolerant systems for stop failure and Byzantine failure that work under enormous and dynamic distributed systems such as Internet agent systems. The fault-tolerant systems are implemented on both JADE and our FIPA compliant agent execution platform (Erdoes). In the demonstration, we introduce ten laptop PCs connected with a 10-Mbps LAN, where eight agents are
running on each PC. On this system we show real faults and recoveries. For stop failure, we shut down any chosen PC at any time, and show that the agents running on it are consistently all recovered on stable machines. The audience can witness the consistency by checking graphically represented application messages exchanged among them. For Byzantine failure, we control any chosen agent on any PC, and show this controlled agent does not have any effect on the behavior of honest agents in any way. Eighty agents are executing sample application programs on the systems, and our methods are shown not to seriously slow down the execution. Technically, our fault-tolerance methods can overcome emerging new characteristics of distributed systems such as Internet agent systems: extremely large size, dynamism, and homogeneity (non-structure).