INDEX
Explanations
words related to switches or actions involving switches
occurrences of the word "switch."
New Auto-Interp
Negative Logits
za
-0.69
oun
-0.68
apolis
-0.67
ICAN
-0.65
vez
-0.63
1981
-0.61
naire
-0.60
Mehran
-0.60
vae
-0.59
kamp
-0.59
POSITIVE LOGITS
blade
1.17
grass
1.11
gear
1.05
backs
1.00
aroo
0.87
switch
0.87
switch
0.86
over
0.82
back
0.82
switches
0.80
Activations Density 0.030%