INDEX
Explanations
mentions of notable figures in entertainment and politics
Tokens preceding names
celebrity names
New Auto-Interp
Negative Logits
-0.75
the
-0.63
a
-0.60
,
-0.59
et
-0.57
.
-0.54
all
-0.53
it
-0.52
more
-0.51
'
-0.51
POSITIVE LOGITS
ⓧ
0.99
Anſ
0.99
abetes
0.98
0.98
mybatisplus
0.97
Efq
0.92
Diſ
0.92
Reſ
0.90
Administrativna
0.87
ArgsConstructor
0.87
Activations Density 0.228%