INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
IB
0.81
ek
0.76
heritage
0.72
costituito
0.71
ia
0.68
ec
0.68
fac
0.66
OS
0.66
het
0.65
ett
0.64
POSITIVE LOGITS
ﺷ
0.97
ార
0.87
ﻣ
0.85
чный
0.84
впервые
0.84
LMP
0.83
رید
0.82
Aram
0.82
Middlesbrough
0.81
основном
0.80
Activations Density 0.000%