INDEX
Explanations
specific references to individuals and their connections or actions within a narrative
New Auto-Interp
Negative Logits
)");
-0.89
"]);
-0.86
NUMX
-0.86
."]
-0.84
".
-0.81
"]();
-0.79
"));
-0.79
مشين
-0.78
.";
-0.78
`]
-0.78
POSITIVE LOGITS
mockito
0.49
famous
0.47
знамени
0.47
פור
0.45
toHave
0.45
featured
0.43
jsii
0.42
famously
0.42
<<<<<<<<<<<<<<
0.41
for
0.41
Activations Density 0.553%