INDEX
Explanations
mentions of the term "bot" in the text
mentions of robots or automated entities
New Auto-Interp
Negative Logits
journal
-0.76
mble
-0.75
uncture
-0.74
egal
-0.68
eway
-0.62
ournal
-0.59
VEN
-0.58
ACTION
-0.58
simultane
-0.57
Koen
-0.57
POSITIVE LOGITS
anical
1.16
cham
0.99
anooga
0.97
ãĤ´ãĥ³
0.89
zeb
0.89
leneck
0.88
ania
0.88
herer
0.88
assium
0.87
wana
0.86
Activations Density 0.008%