INDEX
Explanations
instances of phrases indicating an excessive amount or extent of something
New Auto-Interp
Negative Logits
inarily
-0.69
smith
-0.65
iens
-0.62
season
-0.58
elfth
-0.57
igi
-0.56
emet
-0.55
Except
-0.55
Circuit
-0.55
DIT
-0.55
POSITIVE LOGITS
much
1.09
many
1.02
Much
0.87
Much
0.81
close
0.77
Many
0.76
cheaply
0.76
far
0.75
len
0.75
ls
0.75
Activations Density 0.044%