INDEX
Explanations
comparative expressions indicating an increase in quantity or degree
comparative phrases indicating magnitude or frequency
New Auto-Interp
Negative Logits
Origins
-0.68
Mushroom
-0.67
Dialog
-0.66
����
-0.65
PLA
-0.64
ãĥ¼ãĥĨãĤ£
-0.63
Puppet
-0.62
ãĤº
-0.61
76561
-0.61
AMA
-0.60
POSITIVE LOGITS
pired
0.82
far
0.81
leep
0.79
much
0.77
fast
0.77
itzer
0.77
expensive
0.74
hard
0.72
potent
0.71
schild
0.71
Activations Density 0.041%