INDEX
Explanations
references to sports teams and player statistics
New Auto-Interp
Negative Logits
.LookAndFeel
-0.15
adar
-0.15
saddle
-0.14
.tt
-0.14
ëª
-0.14
pired
-0.14
former
-0.14
necessity
-0.13
everyday
-0.13
ivering
-0.13
POSITIVE LOGITS
tend
0.52
tends
0.45
tended
0.36
likes
0.32
tendency
0.31
love
0.30
loves
0.29
likes
0.27
Likes
0.25
liking
0.25
Activations Density 0.067%