INDEX
Explanations
phrases indicating the source of information
New Auto-Interp
Negative Logits
kasarigan
-0.81
cokinetics
-0.75
OnTrigger
-0.70
IGraphics
-0.66
깐
-0.66
saites
-0.66
timeScale
-0.65
jsPsych
-0.63
MigrationBuilder
-0.62
rolex
-0.62
POSITIVE LOGITS
FROM
1.34
FROM
1.30
From
1.21
from
1.17
From
1.16
from
1.14
từ
1.05
desde
1.03
desde
1.01
Từ
0.99
Activations Density 0.158%