INDEX
Negative Logits
religion
-0.91
bread
-0.89
tagHelperRunner
-0.78
religion
-0.77
posedge
-0.70
gynhyrchwyd
-0.69
InputBorder
-0.68
RELIGION
-0.68
Religion
-0.67
httphttps
-0.62
POSITIVE LOGITS
ing
0.70
ubility
0.62
舍
0.61
the
0.60
ting
0.56
ING
0.55
ton
0.54
'
0.53
.=
0.52
.
0.52
Activations Density 0.046%