INDEX
Explanations
statements or predictions made by individuals
New Auto-Interp
Negative Logits
originals
-0.72
addons
-0.64
disciplines
-0.63
Administ
-0.63
ynski
-0.63
endars
-0.63
azeera
-0.63
issions
-0.62
Models
-0.61
Cosponsors
-0.61
POSITIVE LOGITS
:
1.12
:'
1.05
whereby
1.03
:[
1.02
:-
0.97
!:
0.96
:"
0.96
:(
0.94
:#
0.93
:,
0.91
Activations Density 0.266%