INDEX
Negative Logits
ability
0.41
bugs
0.38
isieren
0.38
ación
0.38
顕
0.37
asist
0.37
aciju
0.37
fahrer
0.37
ativeness
0.36
fulness
0.36
POSITIVE LOGITS
factor
0.82
factors
0.76
因素
0.75
effect
0.72
facteurs
0.67
influence
0.66
Factor
0.66
factor
0.65
Factors
0.65
faktor
0.64
Activations Density 0.319%