INDEX
Negative Logits
predis
-0.09
-th
-0.08
intensive
-0.07
eş
-0.07
Industrial
-0.07
besl
-0.07
deficit
-0.07
-er
-0.07
casual
-0.07
Larson
-0.07
POSITIVE LOGITS
exile
0.12
disgrace
0.10
fugit
0.10
roam
0.09
prince
0.08
AGO
0.08
khỏi
0.08
flee
0.08
xcc
0.08
permanente
0.08
Activations Density 0.006%