INDEX
Explanations
interrogative sentences starting with "What" or "I don't know what" that express uncertainty or confusion
New Auto-Interp
Negative Logits
ulic
-0.78
inence
-0.74
eln
-0.74
heter
-0.74
erate
-0.73
hari
-0.71
robe
-0.71
gur
-0.69
emp
-0.69
enberg
-0.67
POSITIVE LOGITS
happens
1.29
happened
1.28
transpired
1.15
kinds
1.12
else
1.11
constitutes
1.11
happ
1.08
exactly
1.03
soever
0.99
kind
0.97
Activations Density 0.352%