INDEX
Explanations
references to gangs and gang-related terminology
New Auto-Interp
Negative Logits
"}")
-0.54
"}},
-0.48
"](
-0.47
存于互联网档案馆
-0.47
}")]
-0.46
')))
-0.44
addGap
-0.44
"])
-0.43
')")
-0.43
}))
-0.43
POSITIVE LOGITS
Gang
1.14
gang
1.08
Gang
1.06
gangs
1.02
feroit
0.91
ainfi
0.89
gang
0.87
gangsters
0.76
gangster
0.75
pouvoit
0.73
Activations Density 0.005%