INDEX
Explanations
phrases indicating direct speech or quotes
statements or phrases that contain specific legal or formal phrases
New Auto-Interp
Negative Logits
tremend
-0.74
assemb
-0.60
levers
-0.60
mans
-0.60
Bronze
-0.58
dispers
-0.58
scatter
-0.58
($)
-0.58
snail
-0.58
nown
-0.57
POSITIVE LOGITS
ľ
1.30
ª
1.22
¬
1.17
¡
1.16
º
1.09
¿
1.08
¦
1.04
Ń
1.04
¤
1.04
Ĵ
1.03
Activations Density 0.198%