INDEX
Negative Logits
Hwy
-0.07
ти
-0.07
business
-0.07
works
-0.07
Y
-0.06
------+------+
-0.06
jail
-0.06
conductor
-0.06
School
-0.06
Σχ
-0.06
POSITIVE LOGITS
(pg
0.07
dragon
0.06
crawling
0.06
_rc
0.06
rası
0.06
RECE
0.06
_domains
0.06
scraping
0.06
Phrase
0.06
hippoc
0.06
Activations Density 0.011%