INDEX
Explanations
references to numerical data related to various categories or subjects
phrases related to statistical results or numerical data summaries
New Auto-Interp
Negative Logits
trave
-0.62
Safari
-0.58
士
-0.56
adhere
-0.56
Progress
-0.55
Presence
-0.55
realization
-0.54
metab
-0.52
reacts
-0.52
demonstration
-0.52
POSITIVE LOGITS
eight
1.26
nine
1.26
thirteen
1.24
seven
1.24
fourteen
1.23
six
1.22
eleven
1.21
sixteen
1.20
nineteen
1.18
seventeen
1.17
Activations Density 0.070%