INDEX
Explanations
names or terms related to a specific person or character
mentions of specific names or entities, particularly last names
New Auto-Interp
Negative Logits
Scientology
-0.75
chemotherapy
-0.73
corners
-0.70
Commonwealth
-0.69
PBS
-0.66
Fairfax
-0.66
Playboy
-0.65
Telegraph
-0.63
unfocusedRange
-0.63
CBS
-0.63
POSITIVE LOGITS
Sak
1.31
amoto
1.26
urai
1.20
imura
1.11
yu
1.05
unin
1.00
rament
0.98
daq
0.98
arin
0.97
rat
0.96
Activations Density 0.003%