INDEX
Explanations
instances of figurative language and tones indicating ambiguity
New Auto-Interp
Negative Logits
ëĨĢ
-0.14
ihil
-0.13
atomy
-0.13
goto
-0.13
hurst
-0.13
rin
-0.13
angelog
-0.12
arine
-0.12
anta
-0.12
Pow
-0.12
POSITIVE LOGITS
literal
1.05
literally
1.00
Liter
0.94
liter
0.89
literal
0.88
Literal
0.84
Liter
0.77
Literal
0.75
liter
0.70
-liter
0.70
Activations Density 0.127%