INDEX
Explanations
references to idealism and its various forms or expressions
New Auto-Interp
Negative Logits
place
-0.16
whole
-0.15
uyá»ĩt
-0.15
atto
-0.15
lier
-0.15
ips
-0.15
IPS
-0.14
adin
-0.14
igh
-0.14
parents
-0.14
POSITIVE LOGITS
rzy
0.17
ronic
0.17
stice
0.15
isiyle
0.15
оÑĩно
0.15
Ness
0.14
ieval
0.14
ocht
0.14
ansson
0.14
orough
0.14
Activations Density 0.027%