INDEX
Explanations
references to specific individuals, particularly names of people
mentions of people by common first names (proper given names), especially male names.
New Auto-Interp
Negative Logits
Reſ
-0.86
".
-0.82
Anſ
-0.82
TRIBUN
-0.82
(\<
-0.81
―――――
-0.79
)");
-0.78
')")
-0.77
)
-0.76
."</
-0.75
POSITIVE LOGITS
Ed
0.84
Chris
0.81
Tim
0.79
Mike
0.78
mybatisplus
0.76
Nick
0.76
Dave
0.76
Dave
0.75
gebob
0.74
Bob
0.74
Activations Density 0.173%