INDEX
Explanations
phrases related to direct eye contact
the word "the" in various contexts
New Auto-Interp
Negative Logits
arians
-0.73
olicy
-0.71
fn
-0.70
Reason
-0.67
ulner
-0.65
mania
-0.65
ional
-0.65
soever
-0.64
Topics
-0.63
acca
-0.62
POSITIVE LOGITS
midst
1.55
vicinity
1.20
meantime
1.18
aftermath
1.09
guise
1.07
middle
1.04
same
1.03
context
1.02
slightest
0.98
wake
0.95
Activations Density 0.355%