INDEX
Explanations
phrases expressing opinions or thoughts
expressions of belief or anticipation regarding various topics
New Auto-Interp
Negative Logits
panic
-0.75
Shape
-0.73
Usage
-0.69
Individual
-0.67
Applic
-0.65
Attributes
-0.65
soever
-0.64
stroke
-0.62
thinkable
-0.62
pex
-0.62
POSITIVE LOGITS
ourselves
1.16
ours
0.77
our
0.75
parted
0.72
together
0.71
tongues
0.66
Braz
0.64
mutually
0.62
yg
0.62
nesday
0.61
Activations Density 0.373%