INDEX
Explanations
phrases related to top rankings or superlatives
phrases emphasizing rankings or superlative descriptions
New Auto-Interp
Negative Logits
ADRA
-0.80
ulhu
-0.75
Noir
-0.74
Debor
-0.73
Dickinson
-0.71
Richards
-0.70
Borders
-0.70
Indigo
-0.70
Cox
-0.69
Lange
-0.68
POSITIVE LOGITS
sized
1.27
sounding
1.25
generation
1.16
looking
1.15
ever
1.15
exclusive
1.15
ranked
1.12
advertisement
1.08
performing
1.07
ranking
1.05
Activations Density 0.082%