INDEX
Explanations
specific named entities like business names, locations, and titles
prominent names and phrases related to institutions or services
New Auto-Interp
Negative Logits
tremend
-0.72
çͰ
-0.72
deficit
-0.72
streng
-0.71
carbohyd
-0.69
sacrific
-0.68
steroids
-0.66
bowel
-0.65
predec
-0.64
punishing
-0.64
POSITIVE LOGITS
âĦ¢
0.91
ibrary
0.82
Girl
0.82
Jr
0.81
@
0.79
Knight
0.79
()
0.78
wordpress
0.77
Profile
0.77
List
0.76
Activations Density 0.209%