INDEX
Explanations
proper nouns, specifically names of people and organizations
commas followed by proper nouns or names
New Auto-Interp
Negative Logits
Match
-0.64
hift
-0.62
ãĥ¥
-0.58
Rebirth
-0.58
Customers
-0.57
gaps
-0.55
Refresh
-0.54
Cause
-0.53
Compatibility
-0.53
Locations
-0.53
POSITIVE LOGITS
who
0.77
assassinated
0.76
who
0.75
whom
0.74
whose
0.71
Philippe
0.71
Abdullah
0.70
whose
0.70
responsible
0.69
told
0.69
Activations Density 0.746%