INDEX
Explanations
names of people or places
proper nouns, particularly names
New Auto-Interp
Negative Logits
iencies
-0.65
arial
-0.62
itaire
-0.61
drawn
-0.61
geries
-0.60
duplication
-0.59
parachute
-0.59
antine
-0.58
owship
-0.57
draft
-0.57
POSITIVE LOGITS
replied
1.28
âĢ
1.25
(@
1.19
told
1.15
explained
1.13
said
1.12
exclaimed
1.09
tweeted
1.09
joked
1.07
remarked
1.07
Activations Density 0.201%