INDEX
Explanations
words related to preliminary or initial information
words related to relationships and connections between individuals
New Auto-Interp
Negative Logits
çīĪ
-0.68
Schwar
-0.67
Schr
-0.63
catentry
-0.62
OWS
-0.56
ALLY
-0.56
OLD
-0.56
enegger
-0.55
HQ
-0.55
çͰ
-0.54
POSITIVE LOGITS
iminary
0.86
otal
0.77
emic
0.75
umatic
0.74
asant
0.73
emouth
0.73
odon
0.71
iasis
0.70
ient
0.68
eline
0.68
Activations Density 0.103%