INDEX
Explanations
multiple instances of the word "this" or "these" in a text
New Auto-Interp
Negative Logits
LookAnd
-0.72
feroit
-0.64
adl
-0.60
FieldNumber
-0.56
◆◇
-0.55
ABUL
-0.52
ſever
-0.52
AllowUser
-0.51
TLR
-0.51
msgTypes
-0.51
POSITIVE LOGITS
this
1.05
THIS
1.02
this
0.98
This
0.97
This
0.97
THIS
0.90
questa
0.78
этой
0.75
these
0.73
dieser
0.73
Activations Density 0.372%