INDEX
Explanations
phrases related to actions involving changing, toggling, or shifting between different options or states
terms related to the concept of "switching" or changing states
New Auto-Interp
Negative Logits
za
-0.77
apolis
-0.65
ICAN
-0.63
icist
-0.63
zza
-0.63
ALE
-0.63
ORED
-0.62
raham
-0.62
Chicken
-0.61
ORE
-0.61
POSITIVE LOGITS
grass
0.97
switch
0.95
blade
0.94
switch
0.89
switches
0.85
aroo
0.84
backs
0.84
gear
0.84
switched
0.78
switching
0.78
Activations Density 0.020%