INDEX
Explanations
phrases indicating choices and decisions
New Auto-Interp
Negative Logits
inge
-0.16
ahas
-0.15
endra
-0.15
trie
-0.15
stag
-0.15
ader
-0.15
adera
-0.14
dst
-0.14
dre
-0.14
ivil
-0.14
POSITIVE LOGITS
CurrentValue
0.15
ügen
0.15
avicon
0.14
quam
0.14
Pap
0.14
146
0.14
Guinea
0.14
ĻĤ
0.14
816
0.13
getField
0.13
Activations Density 0.015%