Source PaperYearLineSentence
W01-1629 2001 17
Several efficient dialogue control methods have been proposed (Niimi and Kobayashi, 1996; Litman et al, 2000)
N01-1028 2001 17
Reinforcement learning has been used in several recent approaches to search for the optimal dialogue management strategyfor specific dialogue situations (Levin and Pier accini, 1997; Litman et al, 2000; Singh et al,2000; Walker, 2000)
N01-1028 2001 101
In order to study this, we applied foidl to the optimal strategy presented in (Litman et al, 2000), which ?presents a large-scale application of RL[reinforcement learning] to the problem of op timizing dialogue strategy selection [...]?
N01-1028 2001 110
Preconditions on state Action gre eti ng s inf o.# con fid en ce va lue qu est ion # op en /cl ose d his tor y B 1 0 1 F G H expconf(1) B 1 1 E F G H expconf(1) B 1 2 E F G H noconf(1) B 1 4 E 0 G H reaskm(1) B 1 D E 1 G H reasks(1) B 2 0 1 F G H expconf(2) 1 2 2 1 0 0 0 noconf(2) B 2 0 E F 1 1 noconf(2) B 2 2 E F 1 H noconf(2) B 2 D 0 1 G H reaskm(2) B 3 D 1 F G 1 expconf(3) B 3 D 1 F 0 H expconf(3) 1 3 1 1 0 1 0 noconf(3) 1 3 1 1 0 0 1 noconf(3) 1 3 2 1 0 0 1 noconf(3) B 3 0 E F G 0 noconf(3) B 3 0 E F 0 H noconf(3) 1 C 0 0 0 G H asku(C) [2] B C 4 E 0 G H reasks(C) [3] B C 4 0 F G 1 reaskm(C) [2] B C 2 E F 0 1 expconf(C) [2] B C 1 1 F 0 H expconf(C) [5] B C 2 E F 1 0 noconf(C) [3] 0 C D E F G H greetu [1] Table 3: NJFun optimal rulesSee Litman et al (2000) for a more detailed explanation of the state representation
E06-2009 2006 77
This makes it usable for whole dialogue strategies, but also, if desired, it can be targetted only on specific dialoguemanagement decisions (e.g. implicit vs. explicit confir mation, as was done by (Litman et al, 2000))
P09-1010 2009 32
In contrast, our emphasis is on learning language by proactively interacting with an external environment.Reinforcement Learning for Language Processing Reinforcement learning has been previously applied to the problem of dialogue manage ment (Scheffler and Young, 2002; Roy et al, 2000; Litman et al, 2000; Singh et al, 1999)