INDEX
Explanations
mentions of specific entities or topics
the occurrence of a specific placeholder or identifier in the text
New Auto-Interp
Negative Logits
mble
-0.72
Canaver
-0.70
gew
-0.69
Gould
-0.68
vor
-0.68
vim
-0.68
ribly
-0.68
rolet
-0.65
ties
-0.63
athed
-0.62
POSITIVE LOGITS
responders
1.10
Published
1.01
baseman
0.87
impressions
0.83
lady
0.82
Nations
0.82
Appearance
0.81
ancest
0.81
timers
0.79
Lady
0.75
Activations Density 0.060%