INDEX
Explanations
mentions of people knowing or being aware of something
instances of knowledge or awareness about situations or individuals
New Auto-Interp
Negative Logits
livion
-0.67
ramid
-0.66
Featured
-0.65
topp
-0.64
enture
-0.64
uctions
-0.63
detrim
-0.62
contention
-0.61
dissolution
-0.61
Rog
-0.61
POSITIVE LOGITS
KNOW
0.74
Knowing
0.72
Detect
0.72
knowledge
0.68
Knowing
0.67
vez
0.67
Know
0.66
Plain
0.66
Knowledge
0.66
intuitive
0.65
Activations Density 0.586%