INDEX
Explanations
phrases related to maintenance or continuity
New Auto-Interp
Negative Logits
-1.06
ValueStyle
-0.95
Portale
-0.85
OGND
-0.81
Personensuche
-0.80
geldt
-0.76
Adamson
-0.75
Dorsey
-0.74
băr
-0.71
ModelRenderer
-0.71
POSITIVE LOGITS
keep
1.42
KEEP
1.39
kept
1.34
keep
1.34
Keeps
1.32
KEEP
1.32
Keep
1.30
Keep
1.24
keeps
1.24
keeps
1.18
Activations Density 0.045%