INDEX
Explanations
abbreviations used in a specific context
references to specific sections or aspects of a structured document or report
New Auto-Interp
Negative Logits
sbm
-0.69
subord
-0.68
taboola
-0.67
behold
-0.66
naires
-0.66
omorphic
-0.61
omorph
-0.61
ãĥĥãĥĪ
-0.59
hof
-0.59
hower
-0.59
POSITIVE LOGITS
EED
1.42
OIL
1.30
ORT
1.15
ORTS
1.14
ONSORED
1.11
ECT
1.10
IR
1.02
HER
0.98
ERSON
0.96
iral
0.95
Activations Density 0.019%