INDEX
Explanations
actions or suggestions that involve "taking" in various contexts
New Auto-Interp
Negative Logits
務省
-0.71
Miser
-0.68
Horner
-0.65
partiet
-0.65
Miser
-0.64
landet
-0.63
errno
-0.63
irot
-0.63
ynthia
-0.63
Granger
-0.63
POSITIVE LOGITS
Taking
1.57
Taking
1.49
take
1.49
taking
1.43
take
1.41
Take
1.40
taken
1.40
taking
1.40
took
1.37
taken
1.33
Activations Density 0.165%