INDEX
Explanations
proper nouns and locations, particularly those related to sports or media
references to images and photography
New Auto-Interp
Negative Logits
antit
-0.65
comple
-0.63
blinded
-0.62
Arbit
-0.59
arde
-0.59
tot
-0.59
Zar
-0.59
assurance
-0.58
footing
-0.58
propri
-0.58
POSITIVE LOGITS
³³³³
0.85
BBC
0.85
³³³
0.83
ccording
0.82
Politics
0.81
WASHINGTON
0.80
SHARES
0.79
GREEN
0.78
³³³³³³³³
0.78
Python
0.77
Activations Density 0.152%