INDEX
Explanations
references to specific events or gatherings
New Auto-Interp
Negative Logits
Merit
-0.16
Pregn
-0.16
andas
-0.15
_DIP
-0.14
ivé
-0.14
mond
-0.14
pregnant
-0.14
jer
-0.14
auc
-0.14
mann
-0.13
POSITIVE LOGITS
826
0.17
ichen
0.16
elper
0.15
bearing
0.15
Gone
0.15
AFX
0.14
æ·
0.14
yst
0.14
μαÏĦο
0.14
omba
0.14
Activations Density 0.005%