INDEX
Explanations
terms related to wide-reaching effects, actions, or conditions
terms related to broad associations or generalizations within a specific context
New Auto-Interp
Negative Logits
oleon
-0.83
wang
-0.79
pload
-0.79
mass
-0.77
irlf
-0.76
mob
-0.76
GUI
-0.75
mat
-0.74
arte
-0.74
duct
-0.74
POSITIVE LOGITS
condemnation
0.77
closure
0.70
inspection
0.69
interventions
0.69
emergency
0.67
enhancements
0.67
closures
0.67
solutions
0.66
continuum
0.65
omn
0.65
Activations Density 0.137%