INDEX
Explanations
the name "Gab" at varying activation strengths
repeated mentions or references to the name "Gab."
New Auto-Interp
Negative Logits
tenance
-0.82
IDER
-0.73
PATH
-0.71
OCK
-0.71
ctive
-0.68
chnology
-0.67
UME
-0.66
IGHTS
-0.65
soDeliveryDate
-0.63
HEAD
-0.62
POSITIVE LOGITS
Gab
1.06
riel
1.03
onis
0.92
ran
0.92
raham
0.85
ilib
0.84
lock
0.83
rod
0.82
oise
0.80
rets
0.80
Activations Density 0.007%