INDEX
Explanations
comparisons or statements emphasizing the ease of a particular action
sentences that emphasize the concept of ease or simplicity in various contexts
New Auto-Interp
Negative Logits
ulations
-0.70
Griffith
-0.69
inals
-0.68
anuts
-0.67
irrel
-0.66
Anthem
-0.64
alian
-0.62
Nether
-0.61
Ward
-0.61
lt
-0.60
POSITIVE LOGITS
than
1.33
Than
0.95
than
0.93
"$:/
0.91
forgiving
0.84
compr
0.78
toget
0.74
manageable
0.71
prey
0.68
é¾įå¥ij士
0.65
Activations Density 0.023%