INDEX
Explanations
phrases related to taking action or being taken
instances of the word "taken" used in various contexts
New Auto-Interp
Negative Logits
lich
-0.70
hart
-0.67
Pupp
-0.65
til
-0.63
ler
-0.63
ihara
-0.63
eers
-0.62
hips
-0.60
nder
-0.60
Cong
-0.60
POSITIVE LOGITS
aback
1.13
Taken
0.96
aways
0.94
Mehran
0.84
OVER
0.81
ardless
0.80
ACTIONS
0.78
ĸļ
0.77
ĭ
0.76
"$:/
0.76
Activations Density 0.022%