INDEX
Explanations
mentions of positions or rankings
colons and numerical data or statistics related to rankings
New Auto-Interp
Negative Logits
bered
-0.76
tremend
-0.75
omething
-0.74
inement
-0.72
vre
-0.70
uggest
-0.70
uve
-0.69
izons
-0.69
schild
-0.67
alist
-0.65
POSITIVE LOGITS
Provided
0.96
???
0.93
Unknown
0.92
TBD
0.91
Bye
0.85
TBA
0.83
None
0.79
Same
0.78
Miscellaneous
0.76
Nope
0.75
Activations Density 0.085%