
|
Semantic Annotation of SF knowledge bases
|
For the
effective management of SF knowledge, we are currently in the process
of organizing LFs according to their semantic categories. We adopted
eight semantic groups below, derived from the
UMLS semantic groups
[McCray et al. 2001]. For assignment of
semantic groups, we exploited existing knowledge sources in the domain
for the training of supervised machine learning model (see [Liu et al.
2001]). Currently, as a semantic group assignment system,
we use a maximum
entropy model [MaxEnt] trained over (a part of) the entries in the
ADAM database [Zhou et al. 2006] that we
automatically annotated with semantic groups according to the
UMLS [Bodenreider, 2006].
While the model considers with contextual features as well as phrase
internal features (see [Torii et al. 2005]), only the phrase internal
clues are used for this web demo.
|
|
Groups
|
UMLS
Semantic Types
|
|
ANAT
|
Anatomical
Structure, Body Location or Region, Body Part, Organ, or Organ
Component, Body Space or Junction, Body Substance, Body System, Cell,
Cell Component, Embryonic Structure, Fully Formed Anatomical Structure,
Tissue
|
|
CHEM
|
Amino
Acid Sequence, Amino Acid, Peptide, or Protein, Antibiotic,
Biologically Active Substance, Biomedical or Dental Material,
Carbohydrate, Carbohydrate Sequence, Chemical, Chemical Viewed
Functionally, Chemical Viewed Structurally, Clinical Drug, Eicosanoid,
Element, Ion, or Isotope, Enzyme, Gene or Genome, Hazardous or
Poisonous Substance, Hormone, Immunologic Factor, Indicator, Reagent,
or Diagnostic Aid, Inorganic Chemical, Lipid, Molecular Sequence,
Neuroreactive Substance or Biogenic Amine, Nucleic Acid, Nucleoside, or
Nucleotide, Nucleotide Sequence, Organic Chemical, Organophosphorus
Compound, Pharmacologic Substance, Receptor, Steroid, Vitamin
|
|
CONC
|
Classification,
Conceptual Entity, Functional Concept, Group Attribute, Idea or
Concept, Intellectual Product, Language, Qualitative Concept,
Quantitative Concept, Regulation or Law, Spatial Concept, Temporal
Concept
|
|
DISO
|
Acquired
Abnormality, Anatomical Abnormality, Cell or Molecular Dysfunction,
Congenital Abnormality, Disease or Syndrome, Experimental Model of
Disease, Finding, Injury or Poisoning, Mental or Behavioral
Dysfunction, Neoplastic Process, Pathologic Function, Sign or Symptom
|
|
LIVB
|
Age
Group, Alga, Amphibian, Animal, Archaeon, Bacterium, Bird, Family
Group, Fish, Fungus, Group, Human, Invertebrate, Mammal, Organism,
Patient or Disabled Group, Plant, Population Group, Professional or
Occupational Group, Reptile, Rickettsia or Chlamydia, Vertebrate, Virus
|
|
PHYS
|
Cell
Function, Clinical Attribute, Genetic Function, Mental Process,
Molecular Function, Organ or Tissue Function, Organism Attribute,
Organism Function, Physiologic Function
|
|
PROC
|
Diagnostic
Procedure, Educational Activity, Health Care Activity, Laboratory
Procedure, Molecular Biology Research Technique, Research Activity,
Therapeutic or Preventive Procedure
|
|
OTHERS
|
Activity,
Behavior, Biologic Function, Biomedical Occupation or Discipline, Daily
or Recreational Activity, Drug Delivery Device, Entity, Environmental
Effect of Humans, Event, Food, Geographic Area, Governmental or
Regulatory Activity, Health Care Related Organization, Human-caused
Phenomenon or Process, Individual Behavior, Laboratory or Test Result,
Machine Activity, Manufactured Object, Medical Device, Natural
Phenomenon or Process, Occupation or Discipline, Occupational Activity,
Organization, Phenomenon or Process, Physical Object, Professional
Society, Research Device, Self-help or Relief Organization, Social
Behavior, Substance
|
|
Bodenreider O. (2004) The Unified
Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res. 32(Database issue):D267-270.
Liu H, Lussier Y and Friedman C (2001) Disambiguating Biomedical Terms in Biomedical Narrative Text: an Unsupervised Journal of Biomedical Informatics, Elsevier Ltd., 34 (4): 249-61.
MaxEnt.http://maxent.sourceforge.net/.
McCray
AT, Burgun A, Bodenreider O (2001) Aggregating UMLS semantic
types for
reducing conceptual complexity. Proceedings of Medinfo;10(Pt
1):216-20.
Torii M, Kamboj S and Vijay-Shanker K (2005) Using name-internal and contextual features to classify biological terms. Journal of Biomedical Informatics, 37, pp.498–511.
Torii
M and Liu H (2006) Headwords and Suffixes in Biomedical
Names. In proc of Knowledge Discovery in Life Science
Literature (KDLL), pp.29-41.
Zhou W, Torvik VI, Smalheiser NR (2006) ADAM: another database of
abbreviations in MEDLINE. Bioinformatics;
22(22): 2813-2818.
|
|
|
Liu
Lab, Building D, Room 175, 4000 Reservoir Rd NW, Washington, DC
20007 | Phone: 202.687.7933 |Last updated:
February 26, 2007
|
|