INDEX
Explanations
phrases related to accomplishment or achievement
instances of the word "gets."
New Auto-Interp
Negative Logits
contrary
-0.64
amen
-0.61
aret
-0.59
approximately
-0.58
ciples
-0.56
numerous
-0.56
large
-0.56
retched
-0.56
purpose
-0.56
contrasting
-0.55
POSITIVE LOGITS
gets
2.95
Gets
2.22
receives
2.00
loses
1.84
becomes
1.80
earns
1.79
goes
1.78
learns
1.70
survives
1.65
arrives
1.62
Activations Density 0.029%