INDEX
Explanations
references to limits, restrictions, or numerical thresholds, often related to policies or regulations
references to limits or thresholds, particularly in a financial or regulatory context
New Auto-Interp
Negative Logits
ĪĴ
-0.87
hower
-0.74
selves
-0.73
cause
-0.73
cipl
-0.73
isance
-0.68
Bread
-0.66
VD
-0.66
perse
-0.65
Roses
-0.63
POSITIVE LOGITS
itol
1.21
aic
0.93
itals
0.87
illary
0.86
rison
0.82
stan
0.78
itated
0.78
aign
0.76
uchin
0.75
acious
0.74
Activations Density 0.012%