INDEX
Explanations
mentions of names associated with political figures or public officials
proper nouns, particularly names and titles of public figures
New Auto-Interp
Negative Logits
Avalon
-0.66
VID
-0.65
EVE
-0.60
Newtown
-0.60
Kodi
-0.58
Jericho
-0.55
skelet
-0.55
Connor
-0.54
subjective
-0.54
iHUD
-0.54
POSITIVE LOGITS
confid
0.87
Äĩ
0.84
's
0.80
ervatives
0.80
zinski
0.76
Jinping
0.75
appointed
0.74
aide
0.74
tsy
0.73
omics
0.73
Activations Density 0.177%