INDEX
Explanations
phrases related to taking action or creating or manipulating something
instances of the word "make" in various contexts
New Auto-Interp
Negative Logits
æ©Ł
-0.78
anders
-0.68
stration
-0.64
ander
-0.64
rants
-0.63
moil
-0.62
Below
-0.62
thia
-0.62
hood
-0.61
propensity
-0.60
POSITIVE LOGITS
sure
1.66
sense
0.99
ends
0.95
Sure
0.91
sure
0.90
matters
0.89
Sense
0.87
room
0.79
adjustments
0.78
things
0.78
Activations Density 0.099%