INDEX
Explanations
references to historical events and data
New Auto-Interp
Negative Logits
tartalomajánló
-0.84
(_,
-0.80
kasarigan
-0.79
Efq
-0.78
Leyden
-0.78
//
-0.78
bkz
-0.77
Reeve
-0.76
للمعارف
-0.76
חיצוני
-0.76
POSITIVE LOGITS
although
0.67
but
0.59
while
0.57
потому
0.55
2
0.54
4
0.53
as
0.52
5
0.51
0
0.50
particularly
0.49
Activations Density 0.828%