INDEX
Explanations
questions that express curiosity or seek information
New Auto-Interp
Negative Logits
enumi
-0.73
BorderSide
-0.69
enumii
-0.64
entanto
-0.62
FontWeight
-0.61
,
-0.58
цездатний
-0.57
AttributeSet
-0.57
CppMethod
-0.56
ècie
-0.56
POSITIVE LOGITS
?!?
0.91
And
0.82
↵↵
0.76
It
0.75
That
0.71
Or
0.68
particulières
0.68
termica
0.67
Probably
0.67
juridiques
0.66
Activations Density 0.134%