INDEX
Explanations
names of people
proper nouns and names related to people
New Auto-Interp
Negative Logits
youtube
-0.63
Birthday
-0.63
venge
-0.61
Offline
-0.61
Bound
-0.61
renheit
-0.60
Fortress
-0.57
starving
-0.57
Thumbnails
-0.55
IVE
-0.54
POSITIVE LOGITS
added
1.43
said
1.39
explained
1.32
cautioned
1.24
noted
1.22
argued
1.20
pointed
1.17
speculated
1.14
recalled
1.14
conceded
1.12
Activations Density 0.104%