INDEX
Explanations
phrases related to various specific communities or groups
the word "in" and its frequency within context
New Auto-Interp
Negative Logits
enza
-0.72
Completed
-0.64
0000000000000000
-0.63
rifice
-0.62
:/
-0.61
ograms
-0.61
dule
-0.60
iris
-0.60
discrimination
-0.57
iour
-0.57
POSITIVE LOGITS
academia
1.35
Silicon
1.02
circles
0.98
Congress
0.97
fandom
0.94
Europe
0.93
Washington
0.93
roads
0.85
forums
0.83
mainstream
0.83
Activations Density 0.191%