INDEX
Explanations
instances of the word "this" and related demonstrative terms
New Auto-Interp
Negative Logits
Moe
-0.65
Lawton
-0.61
ResponseDto
-0.60
Agamemnon
-0.59
Coff
-0.57
Moe
-0.57
Malone
-0.56
ाष
-0.56
Auvergne
-0.56
Verbs
-0.56
POSITIVE LOGITS
this
1.66
THIS
1.65
THIS
1.56
this
1.49
This
1.39
This
1.38
Этот
1.20
بهذا
1.19
dieses
1.17
denna
1.17
Activations Density 0.364%