INDEX
Explanations
phrases related to composition or makeup
references to components or parts of a whole
New Auto-Interp
Negative Logits
ne
-0.62
EE
-0.59
ii
-0.58
ubb
-0.57
valve
-0.57
fly
-0.57
bye
-0.57
pet
-0.55
poker
-0.55
bury
-0.55
POSITIVE LOGITS
comprises
1.13
comprising
1.10
comprised
1.09
ngth
0.91
consists
0.83
consisted
0.82
comprise
0.80
itialized
0.79
ivities
0.79
clusion
0.79
Activations Density 0.007%