INDEX
Explanations
references to New Jersey
New Auto-Interp
Negative Logits
Buxton
-0.74
المكان
-0.71
occasione
-0.67
baht
-0.64
rückt
-0.63
انجليز
-0.62
Branson
-0.61
audiovisuel
-0.60
onauts
-0.58
kover
-0.57
POSITIVE LOGITS
Jersey
2.31
Jersey
1.97
JERSEY
1.79
jersey
1.45
jersey
1.43
NJ
1.38
NJ
1.13
JER
0.85
jerseys
0.85
nj
0.81
Activations Density 0.060%