INDEX
Explanations
references to a particular individual or cause, likely related to activism or social justice
New Auto-Interp
Negative Logits
çīĪ
-1.11
hack
-0.98
raid
-0.97
venture
-0.96
cript
-0.95
eri
-0.94
æĸ¹
-0.90
elled
-0.90
ENG
-0.88
puff
-0.88
POSITIVE LOGITS
adulthood
0.90
AFTER
0.88
atcher
0.88
iversal
0.86
shore
0.85
terday
0.84
unda
0.84
proven
0.84
they
0.84
someone
0.83
Activations Density 0.293%