INDEX
Explanations
the word "of" occurring in various contexts in the text
New Auto-Interp
Negative Logits
cycles
-0.81
nels
-0.74
photos
-0.71
··
-0.70
Components
-0.67
oons
-0.66
cles
-0.64
usky
-0.64
ktop
-0.63
asar
-0.63
POSITIVE LOGITS
encouragement
1.32
caution
1.11
advice
1.10
wisdom
1.08
recommendation
1.05
warning
1.00
congratulations
0.97
mouth
0.95
condemnation
0.93
appreciation
0.93
Activations Density 0.080%