INDEX
Explanations
words related to investigation or enhancement
words related to intense emotional states or experiences
New Auto-Interp
Negative Logits
Kush
-0.67
aisle
-0.64
uneven
-0.63
liners
-0.60
Cinema
-0.60
driveway
-0.58
runway
-0.58
spotting
-0.58
air
-0.58
Circus
-0.56
POSITIVE LOGITS
ment
0.90
¶æ
0.88
orial
0.81
vous
0.80
ments
0.78
enced
0.78
uel
0.78
orate
0.77
ement
0.77
abil
0.77
Activations Density 0.191%