INDEX
Explanations
phrases indicating intention or plans to do something
phrases that indicate a specific type of group or categorization
New Auto-Interp
Negative Logits
abases
-0.76
terness
-0.73
Mub
-0.69
underest
-0.67
abilities
-0.66
flaws
-0.65
!/
-0.65
amounts
-0.64
asley
-0.64
erity
-0.63
POSITIVE LOGITS
centerpiece
0.99
viable
0.85
cohesive
0.82
reality
0.80
seamless
0.79
permanent
0.79
compulsory
0.78
Ħ¢
0.77
profitable
0.77
worthwhile
0.77
Activations Density 0.191%