INDEX
Explanations
words related to collaboration and co-authorship
New Auto-Interp
Negative Logits
Monfieur
-1.10
Efq
-0.85
ValueGeneration
-0.77
Anſ
-0.76
ſeveral
-0.76
Diſ
-0.74
umman
-0.73
Eſ
-0.72
Verſ
-0.72
Chriſt
-0.72
POSITIVE LOGITS
co
1.32
Co
0.99
coop
0.88
Co
0.82
спів
0.81
Coop
0.80
Coop
0.77
co
0.75
jointly
0.75
spolu
0.74
Activations Density 0.056%