INDEX
Explanations
phrases related to rules, regulations, or measures being implemented
the phrase "in place" or variations of it
New Auto-Interp
Negative Logits
ensor
-0.68
cade
-0.65
iday
-0.62
mini
-0.61
forgiven
-0.60
idays
-0.60
nic
-0.60
ieth
-0.59
ragon
-0.59
spo
-0.59
POSITIVE LOGITS
place
1.52
effect
1.16
place
1.05
activated
1.00
existence
1.00
effect
0.97
effic
0.95
operative
0.92
ked
0.90
operation
0.85
Activations Density 0.170%