INDEX
Explanations
phrases indicating collaboration and collective efforts
New Auto-Interp
Negative Logits
alla
-0.16
baise
-0.15
ritis
-0.14
Discussion
-0.14
nici
-0.14
oning
-0.14
hur
-0.13
ernel
-0.13
Combo
-0.13
tangled
-0.13
POSITIVE LOGITS
compile
0.37
collected
0.36
gathered
0.36
compiling
0.35
collects
0.35
compilation
0.35
compiled
0.34
gathering
0.34
collecting
0.33
gather
0.33
Activations Density 0.167%