INDEX
Explanations
mentions of the drug "rug."
words related to rugs and rug-making
New Auto-Interp
Negative Logits
Gutenberg
-0.66
Petra
-0.64
Sturgeon
-0.63
Dunham
-0.63
Genesis
-0.62
ISO
-0.61
ZI
-0.61
Luthor
-0.60
Dresden
-0.60
hower
-0.59
POSITIVE LOGITS
uay
0.99
rug
0.93
glers
0.93
ulus
0.90
nut
0.88
nuts
0.88
osity
0.87
ular
0.86
atism
0.85
ulent
0.83
Activations Density 0.014%