INDEX
Explanations
references to the city of Toronto
occurrences of the word "Toronto"
New Auto-Interp
Negative Logits
mble
-0.86
hardt
-0.80
bler
-0.76
vana
-0.75
hard
-0.74
ktop
-0.71
warts
-0.70
vier
-0.70
inelli
-0.70
ãĤ©
-0.70
POSITIVE LOGITS
Raptors
0.97
Argon
0.91
Harbour
0.91
Heights
0.86
Toronto
0.85
Maple
0.82
Centre
0.82
skyline
0.79
Citiz
0.75
FC
0.74
Activations Density 0.010%