INDEX
Explanations
phrases that refer to individuals or entities involved in actions or regulations
New Auto-Interp
Negative Logits
DragonMagazine
-0.74
Blossom
-0.71
Lauder
-0.69
Ange
-0.67
roses
-0.65
aisle
-0.65
dust
-0.62
Candle
-0.62
Gor
-0.62
Medals
-0.61
POSITIVE LOGITS
knowingly
1.21
willfully
1.16
unlawfully
1.11
violates
1.05
improperly
1.04
violate
1.03
inadvertently
0.97
incorrectly
0.94
fail
0.94
jeopard
0.93
Activations Density 0.365%