INDEX
Explanations
technical terms and jargon
mentions of disagreements, controversies, or disputes
New Auto-Interp
Negative Logits
touches
-0.76
shroud
-0.71
nude
-0.71
collecting
-0.69
reception
-0.68
Borough
-0.68
clad
-0.68
weights
-0.66
Downs
-0.66
buggy
-0.65
POSITIVE LOGITS
¬
1.02
¹
1.00
£
1.00
Ĥ
1.00
»
0.96
º
0.95
į
0.94
½
0.93
ONSORED
0.92
ķ
0.92
Activations Density 0.131%