INDEX
Explanations
mentions of notable names and locations in news articles or reports
proper nouns or specific names
New Auto-Interp
Negative Logits
ahime
-0.70
atown
-0.70
watch
-0.68
mand
-0.68
Reviewer
-0.67
trak
-0.67
onite
-0.65
ovie
-0.64
uthor
-0.64
mble
-0.63
POSITIVE LOGITS
Cox
0.80
Vaugh
0.71
Plaza
0.69
Bridgewater
0.69
Islands
0.69
Meadows
0.68
Island
0.67
Chapman
0.66
Flame
0.65
Mansion
0.64
Activations Density 0.186%