INDEX
Explanations
references to specific names or titles
references to specific entities, primarily names associated with media and veterinary contexts
New Auto-Interp
Negative Logits
ogly
-0.69
earance
-0.69
quartered
-0.69
eworld
-0.67
brook
-0.66
thritis
-0.66
athering
-0.64
athered
-0.64
oug
-0.63
gotten
-0.63
POSITIVE LOGITS
Diesel
0.76
ideos
0.74
III
0.74
iolet
0.74
Machina
0.73
icious
0.72
alkyrie
0.72
Yanukovych
0.71
rine
0.71
istors
0.70
Activations Density 0.064%