INDEX
Explanations
references to cultural significance and successful transformations
New Auto-Interp
Negative Logits
noop
-0.19
سابÙĤ
-0.16
xdb
-0.15
older
-0.15
previous
-0.15
imes
-0.14
eski
-0.14
æĹ§
-0.14
former
-0.14
ancient
-0.14
POSITIVE LOGITS
full
0.27
household
0.26
mini
0.24
bona
0.23
overnight
0.23
Household
0.20
permanent
0.19
fixture
0.19
entity
0.19
fully
0.19
Activations Density 0.192%