INDEX
Explanations
phrases related to restrictions and regulations
New Auto-Interp
Negative Logits
kit
-0.14
οÏħÏĤ
-0.14
Explosion
-0.14
رÙĪØ³
-0.14
stim
-0.14
Stim
-0.13
Opr
-0.13
iy
-0.13
Kits
-0.13
ibbon
-0.13
POSITIVE LOGITS
imposed
0.33
lifted
0.29
implemented
0.29
applied
0.28
imposition
0.27
Lift
0.27
orders
0.26
implemented
0.26
implementation
0.26
Implemented
0.26
Activations Density 0.073%