INDEX
Explanations
mentions of personal knowledge or beliefs
references to personal experiences and opinions
New Auto-Interp
Negative Logits
moderation
-0.73
smashing
-0.69
masc
-0.68
menstrual
-0.67
extravag
-0.64
festive
-0.64
interstitial
-0.63
gren
-0.61
scra
-0.61
Interstitial
-0.60
POSITIVE LOGITS
know
1.73
know
1.62
KNOW
1.58
Know
1.54
Know
1.53
knows
1.50
knew
1.49
knowledge
1.35
knowing
1.35
knowledge
1.24
Activations Density 0.286%