INDEX
Explanations
sentences ending with a full stop and with a high amount of certainty or decisiveness
sentences that express uncertainty or questions about understanding and social dynamics
New Auto-Interp
Negative Logits
upgr
-0.73
unden
-0.72
landfall
-0.68
panc
-0.68
mammoth
-0.68
ernaut
-0.68
exclusively
-0.68
sequential
-0.68
untouched
-0.67
sustained
-0.66
POSITIVE LOGITS
Especially
1.20
Usually
1.17
Regardless
1.17
They
1.17
Often
1.15
Their
1.15
Particularly
1.13
Examples
1.12
Flavoring
1.11
Whether
1.10
Activations Density 0.750%