INDEX
Explanations
instances where strong opinions or criticisms are expressed
New Auto-Interp
Negative Logits
thood
-0.74
utm
-0.69
reve
-0.68
suppose
-0.68
ciplinary
-0.66
craft
-0.66
Malley
-0.66
Iter
-0.65
FontSize
-0.64
thereof
-0.63
POSITIVE LOGITS
same
1.62
hardest
1.38
slightest
1.30
brunt
1.25
ses
1.25
entirety
1.21
toughest
1.20
same
1.20
fastest
1.13
entire
1.13
Activations Density 0.225%