INDEX
Explanations
references to various biological species
New Auto-Interp
Negative Logits
imp
-0.69
Earth
-0.68
sup
-0.67
esp
-0.66
supre
-0.65
sla
-0.63
lit
-0.62
bay
-0.62
inf
-0.61
jab
-0.61
POSITIVE LOGITS
desmotivaciones
0.98
mijne
0.89
zijne
0.87
miniaturka
0.81
zoude
0.78
imaginación
0.77
finanzas
0.75
dezelve
0.75
stratégique
0.75
مشين
0.74
Activations Density 0.535%