INDEX
Explanations
instances of numbers and identifying characteristics
information about historical or factual events
New Auto-Interp
Negative Logits
another
-0.69
Another
-0.66
elsewhere
-0.62
Another
-0.60
Nope
-0.59
instead
-0.59
Otherwise
-0.56
..........
-0.55
sans
-0.54
less
-0.54
POSITIVE LOGITS
culminating
0.61
culminated
0.58
influencing
0.55
rued
0.55
ULTS
0.55
sequently
0.52
contributors
0.51
interpre
0.51
//[
0.51
genesis
0.50
Activations Density 0.887%