INDEX
Explanations
terms related to quarantine and restrictions
New Auto-Interp
Negative Logits
ian
-0.17
iga
-0.17
454
-0.17
igid
-0.17
sal
-0.16
hin
-0.16
iate
-0.15
enia
-0.15
ging
-0.15
451
-0.15
POSITIVE LOGITS
Quar
0.29
quar
0.28
antine
0.28
/qu
0.22
rels
0.21
qu
0.21
-qu
0.21
Qu
0.17
tern
0.17
_:*
0.17
Activations Density 0.006%