INDEX
Explanations
demonstrative pronouns, particularly "this" and "these."
New Auto-Interp
Negative Logits
Agamemnon
-0.75
Coff
-0.69
Lw
-0.69
ResponseDto
-0.68
Moe
-0.68
Moe
-0.66
Dubuque
-0.66
Verbs
-0.66
orszá
-0.65
enderror
-0.65
POSITIVE LOGITS
this
2.16
this
1.96
THIS
1.93
THIS
1.81
This
1.71
This
1.70
questa
1.46
questo
1.42
dieses
1.41
esta
1.37
Activations Density 0.365%