INDEX
Explanations
phrases expressing comparisons
phrases indicating a balance of circumstances or decisions
New Auto-Interp
Negative Logits
iliated
-0.73
uilt
-0.72
affiliated
-0.71
imentary
-0.70
etheless
-0.70
lav
-0.68
sequently
-0.67
ordable
-0.67
icago
-0.67
tions
-0.67
POSITIVE LOGITS
roses
0.89
Roses
0.83
sheep
0.78
Fool
0.77
Sheep
0.75
Throne
0.74
Horses
0.72
cake
0.72
fools
0.69
Mouse
0.68
Activations Density 1.038%