INDEX
Explanations
words related to specific historical and cultural entities, especially those pertaining to German-speaking regions and Lutheranism
New Auto-Interp
Negative Logits
myſelf
-0.61
Behav
-0.56
Jefus
-0.55
LabelTagHelper
-0.54
جغرافيا
-0.52
évaluateur
-0.51
Retail
-0.50
везе
-0.50
ſeveral
-0.49
reaſon
-0.49
POSITIVE LOGITS
findpost
0.74
Roskov
0.73
otomatig
0.71
Burnett
0.66
nakalista
0.66
Petra
0.62
Karl
0.60
Siemens
0.59
Lightboxes
0.59
PasswordEncoder
0.58
Activations Density 2.652%