INDEX
Explanations
numerical data representing significant statistics or measurements
New Auto-Interp
Negative Logits
1
-0.47
2
-0.35
poderosos
-0.32
sourire
-0.31
années
-0.31
8
-0.31
cercanos
-0.30
Freundin
-0.30
5
-0.30
7
-0.29
POSITIVE LOGITS
ロウィン
0.89
AspNetCore
0.84
ſchaft
0.84
nahilalakip
0.84
الحره
0.82
<unused52>
0.82
<unused41>
0.81
contentLoaded
0.81
<unused28>
0.81
[@BOS@]
0.81
Activations Density 0.212%