INDEX
Explanations
proper nouns related to people and places
references to specific people, places, and recognized achievements
New Auto-Interp
Negative Logits
Tokens
-0.56
cohesion
-0.52
IPM
-0.52
conventions
-0.51
inputs
-0.51
dissatisf
-0.51
Zot
-0.50
shenan
-0.50
iPads
-0.49
VIEW
-0.49
POSITIVE LOGITS
agonist
0.75
utenant
0.71
appointed
0.66
assador
0.66
atcher
0.65
dancer
0.63
inmate
0.63
lover
0.62
apper
0.62
trained
0.61
Activations Density 0.683%