INDEX
Explanations
the word "cabinet" and words related to furniture or office supplies
New Auto-Interp
Negative Logits
/
-0.63
(
-0.60
,
-0.57
rightfully
-0.56
...
-0.55
…
-0.54
-
-0.53
Autoritní
-0.53
dia
-0.53
fucking
-0.52
POSITIVE LOGITS
vectorielle
1.14
feroit
1.02
ainfi
1.00
auroit
1.00
berdayakan
1.00
abstrait
0.99
étoit
0.99
Theſe
0.98
Reſ
0.96
ientôt
0.96
Activations Density 1.235%