INDEX
Explanations
instances of one entity providing something to another
actions related to giving or providing something
New Auto-Interp
Negative Logits
psc
-0.75
inav
-0.62
urities
-0.61
imagin
-0.61
collisions
-0.59
WATCHED
-0.58
ski
-0.58
.","
-0.57
compr
-0.57
LINE
-0.57
POSITIVE LOGITS
assurances
0.89
us
0.87
him
0.86
them
0.78
AMA
0.73
refunds
0.70
condolences
0.70
directions
0.69
me
0.66
HK
0.66
Activations Density 0.200%