INDEX
Explanations
repeated mentions of a specific individual, specifically Wilson
New Auto-Interp
Negative Logits
枚
-0.55
MD
-0.52
ագ
-0.49
opts
-0.48
dtd
-0.48
od
-0.48
yere
-0.48
arda
-0.47
Kabir
-0.47
beri
-0.47
POSITIVE LOGITS
Wilson
0.92
SerializedSize
0.89
lung
0.86
coal
0.85
Wilson
0.81
Coal
0.78
wilson
0.77
WILSON
0.77
فريبيس
0.75
Roskov
0.74
Activations Density 0.079%