INDEX
Explanations
mentions of individuals or entities that are held in high esteem or regarded positively
New Auto-Interp
Negative Logits
erial
-0.16
[port
-0.15
heim
-0.15
thumbs
-0.14
izador
-0.14
maduras
-0.14
киÑģл
-0.14
ayscale
-0.13
еÑĤи
-0.13
afi
-0.13
POSITIVE LOGITS
Kurum
0.15
abant
0.14
tern
0.14
uten
0.14
WithValue
0.14
vÄĽ
0.13
rnÄĽ
0.13
oshi
0.13
713
0.13
894
0.13
Activations Density 0.005%