INDEX
Explanations
contextual references to the term "this."
this followed by code or punctuation
New Auto-Interp
Negative Logits
ſei
-0.79
<unused14>
-0.78
[@BOS@]
-0.78
<unused8>
-0.78
<unused68>
-0.78
<unused79>
-0.78
<unused43>
-0.78
<unused42>
-0.78
<unused23>
-0.78
<unused41>
-0.78
POSITIVE LOGITS
this
0.87
This
0.65
THIS
0.61
this
0.60
This
0.59
THIS
0.58
acest
0.48
self
0.45
diesem
0.45
acestui
0.45
Activations Density 0.006%