INDEX
Explanations
references to political processes and their implications
New Auto-Interp
Negative Logits
,
-0.81
and
-0.64
EndProject
-0.64
as
-0.61
but
-0.60
והוא
-0.52
Референце
-0.52
があり
-0.52
ніципа
-0.50
because
-0.50
POSITIVE LOGITS
purpoſe
0.95
ſever
0.95
themſelves
0.94
ſche
0.94
Conſ
0.93
pleaſure
0.90
itſelf
0.89
ſmall
0.87
unſ
0.86
Reſ
0.86
Activations Density 1.704%