INDEX
Explanations
substrings following a pattern of slashes and names with varying activation values
occurrences of the "REUTERS/" marker indicating news sources
New Auto-Interp
Negative Logits
prol
-0.77
worms
-0.73
beetles
-0.72
worms
-0.72
idol
-0.70
verbs
-0.69
infertility
-0.68
sooner
-0.67
ĪĴ
-0.65
snakes
-0.65
POSITIVE LOGITS
File
0.99
AFP
0.94
Getty
0.93
David
0.93
Mike
0.90
Jim
0.89
Flickr
0.88
Rick
0.87
Brend
0.87
Jonathan
0.87
Activations Density 0.026%