INDEX
Explanations
phrases or sequences of words related to proper nouns such as movie titles, place names, and technology terms
sentences or phrases that signify a concluding statement
New Auto-Interp
Negative Logits
ozyg
-0.79
involuntary
-0.66
thouse
-0.65
advis
-0.64
oak
-0.64
inous
-0.62
idle
-0.62
bullied
-0.61
unch
-0.61
barric
-0.61
POSITIVE LOGITS
respectively
1.30
These
1.29
Both
1.23
Each
1.23
Together
1.21
Also
1.19
They
1.19
Basically
1.17
Additionally
1.12
Those
1.10
Activations Density 0.540%