INDEX
Explanations
phrases indicating social or political dynamics, particularly involving recognition and accountability
New Auto-Interp
Negative Logits
avern
-0.15
aurus
-0.15
isma
-0.15
æľĢæĸ°
-0.15
agli
-0.15
hrad
-0.14
enda
-0.14
atham
-0.14
INE
-0.14
68
-0.14
POSITIVE LOGITS
serious
0.18
permanent
0.17
ÑĢÑĥн
0.17
major
0.17
sophisticated
0.16
sut
0.16
entire
0.16
뢰
0.16
permanent
0.16
vip
0.15
Activations Density 0.020%