INDEX
Explanations
sentences structured around the subject "it" that contains assessments or claims
New Auto-Interp
Negative Logits
aise
-0.18
odont
-0.17
themselves
-0.16
499
-0.15
whom
-0.15
à¤īनà¤ķ
-0.15
himself
-0.14
their
-0.14
898
-0.14
oints
-0.14
POSITIVE LOGITS
itself
0.31
its
0.30
Its
0.27
Its
0.26
its
0.21
å®ĥ们
0.19
iner
0.17
ï¼Įå®ĥ
0.17
коÑĤоÑĢое
0.17
Ñıке
0.17
Activations Density 0.199%