INDEX
Explanations
numerals followed by hyphens, usually indicating a range
quantifiable statistics or data presented in a numerical format
New Auto-Interp
Negative Logits
turbulence
-0.74
Cassidy
-0.72
scenery
-0.65
NV
-0.63
Brav
-0.62
Burr
-0.62
Uni
-0.60
Osw
-0.59
transc
-0.58
popcorn
-0.58
POSITIVE LOGITS
five
1.88
eight
1.84
seven
1.83
six
1.80
nine
1.79
four
1.72
three
1.71
two
1.64
Eight
1.37
fifth
1.34
Activations Density 0.023%