INDEX
Explanations
phrases related to results or consequences
references to outcomes in various contexts
New Auto-Interp
Negative Logits
cer
-0.82
nan
-0.77
elong
-0.71
ondo
-0.70
vor
-0.70
ju
-0.69
ker
-0.69
bia
-0.69
entin
-0.68
azines
-0.68
POSITIVE LOGITS
outcome
1.38
outcomes
1.13
bringer
0.76
Result
0.76
result
0.74
icter
0.71
DragonMagazine
0.70
ilater
0.69
bystanders
0.68
winner
0.68
Activations Density 0.008%