INDEX
Explanations
the word "right" in various contexts
the word "right" in various contexts
New Auto-Interp
Negative Logits
RESULTS
-0.66
Cannot
-0.64
Mechdragon
-0.64
Ibid
-0.60
ously
-0.59
Deaths
-0.58
kel
-0.58
agos
-0.57
ega
-0.57
ATA
-0.56
POSITIVE LOGITS
right
3.46
RIGHT
2.84
right
2.60
Right
2.54
Right
2.40
wrong
1.54
correct
1.52
left
1.46
left
1.33
wrong
1.30
Activations Density 0.049%