INDEX
Explanations
the word 'helps'
phrases indicating changes, restrictions, or consequences in contexts involving actions or statements
New Auto-Interp
Negative Logits
Panama
-0.84
furt
-0.79
sburg
-0.76
PAN
-0.75
Franc
-0.75
gart
-0.74
BART
-0.73
isman
-0.73
ão
-0.73
urger
-0.72
POSITIVE LOGITS
Sy
1.95
Dy
1.74
Ly
1.73
Cy
1.65
Sy
1.58
sy
1.55
Ly
1.50
Ty
1.45
SY
1.45
sy
1.44
Activations Density 0.301%