INDEX
Explanations
phrases mentioning common sense or commonality
references to "common sense" and its variations in different contexts
New Auto-Interp
Negative Logits
Become
-0.74
zona
-0.74
asus
-0.72
endas
-0.70
letes
-0.69
atoon
-0.67
hetamine
-0.66
otos
-0.66
agate
-0.66
awaru
-0.65
POSITIVE LOGITS
wealth
1.63
alities
1.41
ality
1.28
denomin
1.26
places
1.17
place
1.02
ancestor
1.02
sense
1.00
decency
0.97
alties
0.91
Activations Density 0.034%