INDEX
Explanations
the word "gar" with varying levels of activation
the word "gar" in various contexts
New Auto-Interp
Negative Logits
ivity
-0.74
anwhile
-0.74
vironment
-0.72
psey
-0.71
gdala
-0.70
reckoning
-0.70
orer
-0.70
terday
-0.69
bargaining
-0.69
constitu
-0.68
POSITIVE LOGITS
gar
1.01
rets
0.90
bage
0.89
rier
0.86
neau
0.84
zik
0.81
rics
0.79
lean
0.79
rett
0.78
rius
0.78
Activations Density 0.005%