INDEX
Explanations
connections and relationships expressed through conjunctions
New Auto-Interp
Negative Logits
BOVE
-0.17
lut
-0.16
rcode
-0.14
ÏĦικα
-0.14
Sexe
-0.14
gia
-0.14
dorf
-0.14
plode
-0.14
lrt
-0.14
lom
-0.14
POSITIVE LOGITS
aul
0.18
erson
0.16
transparent
0.14
fx
0.14
FX
0.14
ought
0.14
oust
0.14
aine
0.14
ough
0.14
iles
0.14
Activations Density 0.115%