INDEX
Explanations
references to folk culture and folk art
New Auto-Interp
Negative Logits
ãĤº
-0.20
s
-0.18
eel
-0.16
ccount
-0.15
points
-0.15
IAL
-0.15
yx
-0.15
ee
-0.15
conciliation
-0.15
ity
-0.15
POSITIVE LOGITS
lor
0.38
lore
0.31
лоÑĢ
0.21
swagen
0.19
oric
0.18
wisdom
0.18
/pop
0.17
ore
0.17
vang
0.17
adelphia
0.17
Activations Density 0.007%