INDEX
Explanations
expressions related to safety, well-being, and family reunification
phrases related to safety and reunion
New Auto-Interp
Negative Logits
eyebrow
-0.77
inference
-0.77
bullish
-0.71
forecasting
-0.69
fielded
-0.69
objection
-0.69
estimating
-0.67
bluff
-0.66
antitrust
-0.66
ado
-0.65
POSITIVE LOGITS
alive
1.21
forever
1.10
someday
0.96
uncond
0.96
captivity
0.92
peacefully
0.92
folk
0.89
forts
0.86
arest
0.86
rieve
0.85
Activations Density 0.452%