INDEX
Explanations
proper nouns, particularly names and initials associated with individuals or authors
New Auto-Interp
Negative Logits
Theſe
-0.62
Beſ
-0.56
contemporaine
-0.55
Jefus
-0.53
leaſt
-0.51
itſelf
-0.51
Continuar
-0.50
zke
-0.50
Houſe
-0.49
Monfieur
-0.49
POSITIVE LOGITS
ModelExpression
0.76
awtextra
0.74
tagHelperRunner
0.71
AndEndTag
0.70
BeginContext
0.70
webElementGuid
0.69
mouseY
0.69
AssemblyProduct
0.67
חיצוניים
0.67
]")]
0.66
Activations Density 0.327%