INDEX
Explanations
mentions of political figures and their administrations
New Auto-Interp
Negative Logits
rab
-0.47
RUnlock
-0.43
ржа
-0.41
قي
-0.39
aux
-0.39
一度
-0.39
cue
-0.39
INTERESAR
-0.38
IActionResult
-0.38
Somehow
-0.38
POSITIVE LOGITS
purpoſe
0.86
newVal
0.81
leſs
0.81
InputTagHelper
0.80
houſe
0.74
pleaſure
0.74
kasarigan
0.74
Numerade
0.72
iastes
0.72
StoryboardSegue
0.71
Activations Density 0.345%