INDEX
Explanations
sentences expressing personal opinions or doubts
New Auto-Interp
Negative Logits
aarrggbb
-0.70
vitae
-0.67
Zufall
-0.61
meni
-0.60
PageContext
-0.59
Yii
-0.59
&&
-0.58
erati
-0.58
Axiom
-0.58
avía
-0.58
POSITIVE LOGITS
Heres
1.19
Theres
1.15
thats
1.14
theres
1.13
dont
1.10
Theres
1.09
wasnt
1.08
Dont
1.06
youre
1.06
isnt
1.05
Activations Density 0.091%