INDEX
Explanations
links or online references
instances of punctuation or commas in the text
New Auto-Interp
Negative Logits
chin
-0.59
Turns
-0.57
Opening
-0.53
Become
-0.52
Round
-0.52
unnecess
-0.51
Rounds
-0.50
!--
-0.49
Ground
-0.49
reach
-0.48
POSITIVE LOGITS
respectively
1.21
etc
0.91
meanwhile
0.85
whereas
0.84
wherein
0.82
which
0.81
however
0.81
albeit
0.77
although
0.75
rahim
0.72
Activations Density 0.325%