INDEX
Explanations
statements of personal experiences and opinions
New Auto-Interp
Negative Logits
seemingly
-0.23
nicht
-0.23
seems
-0.22
seem
-0.22
seemed
-0.21
không
-0.20
not
-0.20
नह
-0.19
doesn
-0.19
Seems
-0.19
POSITIVE LOGITS
fairly
0.18
overall
0.17
fair
0.17
overall
0.17
fair
0.17
maybe
0.16
mostly
0.16
mostly
0.15
slightly
0.15
оÑĢалÑĮ
0.15
Activations Density 0.238%