INDEX
Explanations
phrases and structures that indicate comparative or relational contexts
New Auto-Interp
Negative Logits
betweenstory
-0.91
wikipagina
-0.85
tartalomajánló
-0.78
leprosy
-0.75
JAXBElement
-0.74
Chaldean
-0.73
MDS
-0.70
dredge
-0.70
Viper
-0.69
NSCoder
-0.69
POSITIVE LOGITS
no
0.64
0.57
있으며
0.57
major
0.57
wijl
0.56
having
0.55
prič
0.55
iendo
0.54
most
0.52
notable
0.52
Activations Density 0.190%