INDEX
Explanations
reported speech and statements made by individuals
New Auto-Interp
Negative Logits
according
-0.16
unger
-0.15
Brock
-0.15
Ģ
-0.15
thunder
-0.15
etc
-0.15
ony
-0.15
too
-0.15
ira
-0.14
olf
-0.14
POSITIVE LOGITS
explan
0.18
.Guna
0.16
è¿Ľä¸ĢæŃ¥
0.16
weiter
0.15
further
0.15
laughing
0.15
/Dk
0.14
isman
0.14
umptech
0.14
OrElse
0.14
Activations Density 0.049%