INDEX
Explanations
proper nouns and specific names in the text
New Auto-Interp
Negative Logits
ModelExpression
-0.66
Chwiliwch
-0.55
kaarangay
-0.54
findpost
-0.48
enterOuterAlt
-0.48
IUrlHelper
-0.47
addCriterion
-0.47
invokingState
-0.46
anguages
-0.46
pozdrawiam
-0.45
POSITIVE LOGITS
Fit
0.49
Fit
0.47
Her
0.42
Put
0.41
fach
0.41
Put
0.40
Nur
0.39
Kus
0.39
Firm
0.39
inaldi
0.38
Activations Density 0.192%