INDEX
Explanations
references to user actions and interactions within a digital platform
New Auto-Interp
Negative Logits
amer
-0.72
Baptist
-0.70
Lutheran
-0.70
von
-0.63
stress
-0.61
hovah
-0.61
BJ
-0.61
Tempest
-0.61
Scand
-0.60
Vaugh
-0.60
POSITIVE LOGITS
interface
1.06
interfaces
1.01
interface
0.96
pace
0.95
Interface
0.92
base
0.91
cript
0.87
Agent
0.82
Interface
0.81
ifest
0.77
Activations Density 0.454%