INDEX
Explanations
references to societal issues and trends
New Auto-Interp
Negative Logits
oice
-0.18
ayah
-0.17
lich
-0.15
acman
-0.15
anke
-0.14
lico
-0.14
Clover
-0.14
isper
-0.14
ertz
-0.14
refresh
-0.14
POSITIVE LOGITS
apat
0.19
increasingly
0.17
iad
0.14
ViewById
0.14
رب
0.14
catid
0.14
dzi
0.14
ORIZATION
0.14
apo
0.13
ä½IJ
0.13
Activations Density 0.506%