INDEX
Explanations
instances of the period punctuation mark
New Auto-Interp
Negative Logits
��
-0.67
ufact
-0.65
earch
-0.62
��
-0.61
carp
-0.60
sembly
-0.59
ometimes
-0.58
regor
-0.58
Cabin
-0.55
slate
-0.53
POSITIVE LOGITS
ts
0.85
kb
0.79
arium
0.70
lli
0.70
chenko
0.69
kov
0.68
vice
0.67
kr
0.66
cks
0.64
lla
0.62
Activations Density 0.015%