INDEX
Explanations
phrases related to the concept of common sense
references to "common sense" in various contexts
New Auto-Interp
Negative Logits
atern
-0.74
leased
-0.71
boxing
-0.70
soon
-0.70
rose
-0.65
ench
-0.64
ISS
-0.64
Stars
-0.62
usky
-0.60
ouched
-0.60
POSITIVE LOGITS
smanship
0.96
approach
0.88
abl
0.83
prud
0.81
prudent
0.80
prevailed
0.80
prag
0.79
Approach
0.76
ensical
0.76
decency
0.74
Activations Density 0.119%