INDEX
Explanations
dialogue and interactions among characters, particularly in legal contexts
Follows questions or replies
asking and answering questions
New Auto-Interp
Negative Logits
CppMethod
-0.61
мәкал
-0.59
مشارکتکنندگان
-0.57
Heer
-0.52
][]
-0.51
Tikang
-0.50
Kerk
-0.49
toolStripButton
-0.49
Twee
-0.49
اعد
-0.48
POSITIVE LOGITS
replied
0.79
reply
0.70
Answer
0.68
Darauf
0.67
reply
0.66
answer
0.65
replying
0.65
Antwort
0.63
respondeu
0.63
躇
0.61
Activations Density 0.209%