INDEX
Explanations
words related to limitations, restrictions, or boundaries
references to limitations or restrictions
New Auto-Interp
Negative Logits
estone
-0.73
Honour
-0.71
uni
-0.68
joy
-0.67
story
-0.67
mberg
-0.67
tein
-0.65
hod
-0.63
************
-0.63
psc
-0.62
POSITIVE LOGITS
constraints
1.24
constraint
1.11
constrained
1.01
imposed
0.92
restraints
0.90
dictates
0.88
pressures
0.86
cooker
0.84
besie
0.81
restricts
0.76
Activations Density 0.010%