INDEX
Explanations
proper nouns, specifically names of individuals and their affiliations in news articles
New Auto-Interp
Negative Logits
newspapers
-0.45
AspNetCore
-0.45
ujednoznacz
-0.42
]-->
-0.42
Eloquent
-0.41
TextSpan
-0.41
gonic
-0.41
참고
-0.40
WithEmail
-0.40
Anya
-0.40
POSITIVE LOGITS
Tikang
0.73
transQ
0.72
قایناقلار
0.67
"])
0.63
ništ
0.61
'\\;'
0.59
RegressionTest
0.58
समीक्षाओं
0.57
}))
0.56
<=",
0.56
Activations Density 0.022%