Urbigene

Content

What is FOAF and how can I use it ?
What is SciFOAF ?
What is MyFOAFExplorer ?
Download
Building
Using SciFOAF and MyFOAFExplorer
References
Links

What is FOAF and how can I use it ?

The FOAF project defines a semantic format based on RDF/XML to define persons or groups, their relationships, as well as their basic properties such as name, e-mail address, subjects of interest, publication, and so on...

What is SciFOAF ?

SciFOAF is an interface used to generate a FOAF file using NCBI Pubmed. The PubMed bibliographic database is a powerful source to discover collaboration networks as each paper submitted contains potential information about each author. An interface called SciFOAF (Scientific FOAF) was written to generate a FOAF file from the literature found in PubMed. It is initialized with a PubMed identifier (PMID) that is used as a seed to start browsing a collaboration network and the user is asked to select the authors of this paper that will appear in the FOAF file. Authors are stored in a 'stack' and the user interaction will be required until this stack is empty. For each person in the stack complementary information (such as e-mail, web site...) can be fulfilled and his latest publications are fetched from NCBI using the NCBI E-Utilities tool. Then, for each article the user is asked if the current person is unambiguously one of the authors and whether he knows any of the co-authors (which may not be necessarily the case when there is a large number of collaborators) and each of these new co-author is added on top of the stack. Laboratories addresses are extracted from the articles and the user can select among this list to affiliate the current author to one or more laboratory. The user can also link the current author with a person already depicted if they know each other but did not take part in a common publication. Authorâ€™s interests are defined using the MESH terms of their papers. Once all information about the current author have been fulfilled he is removed from the stack and at the end, the user can save its FOAF file as a XML/RDF file. This process could have been generated automatically by a recursive program without human intervention if there was not ambiguity between authors names and if the addresses of the laboratories were always written the same way but this is unfortunately not the case: the same address of a laboratory can be written differently from one paper to another and there is no unique identifier that could be used to discriminate two homonymous authors within the XML description of papers. Moreover, authorsâ€™ name definition may vary from one journal to another as some journal may use the initial of an author while another may use the complete first name. Furthermore the NCBI E-Search utility just uses the family name and the initial so homonymous authors may be fetched with the same query. In consequence SciFOAF requests the user to confirm/infirm the relationships between authors and their publications.

How to use SciFOAF ?

Start your network description by a NCBI Pubmed ID (PMID). e.g. 15047801.

Select people you know and you want to include in your network

For each selected author, complete its personal information (mail, web page...) and select his affiliated laboratories

For each paper, check if the paper was written by the current author. When you're done with this author click on the 'NEXT' button at the bottom of the scrollable page

When your done (there is no more author on the stack), click on the 'Save ' button to export the RDF/XML describing the network

What is MyFOAFExplorer and how can I use it ?

MyFOAFExplorer was created after SciFOAF in order to browse a scientific network, its publications, its laboratories,... The program is an applet that takes as input a FOAF file. The applet can be downloaded and used for your laboratoy/social network to introduce your staff, your publications,...
The code does not contain a compliant RDF parser (such as JENA), but the way it parses the XML works fine for my test, so it could be used as a framework to write your own FOAF file.

Screenshots

Download

The project is hosted on sourceforge.net.

Building

ANT and JAVAC are required:

Buildfile: build.xml

compile-all:
    [javac] Compiling 2 source files

jar-all:
      [jar] Building jar: urbigene/urbigene.jar

all:

BUILD SUCCESSFUL
Total time: 17 seconds

Using SciFOAF and MyFOAFExplorer.

To run SciFOAF: java -cp urbigene.jar org.lindenb.scifoaf.SciFOAF

To run MyFOAFExplorer: java -cp urbigene.jar org.lindenb.foafexplorer.Main sample/foaf.xml

To run MyFOAFExplorer as an applet:

<applet archive="urbigene.jar" width="640" height="640" 
code="org.lindenb.foafexplorer.FOAFApplet">
<param name="foaf" value="foaf.xml">
</applet>

References

Bachrach,C.A. and Charen,T. (1978) Selection of MEDLINE contents, the development of its thesaurus, and the indexing process. Medical Informatics (London), 3,237-254.
Bohne-Lang,A., Lang,E. (2005) Do we need a Unique Scientist ID for publications in biomedicine? Biomed Digit Libr., Mar 22;2(1):1.
Newman,M.E. (2001) The structure of scientific collaboration networks. Proc. Natl. Acad. Sci. USA., 98,404-409.
Ding,L., Zhou L., Finin T. and Joshi A. (2005) How the Semantic Web is Being Used: An Analysis of FOAF Documents. Proceedings of the 38th International Conference on System Sciences.
Wheeler,D.L., Barrett,T., Benson,D.A., Bryant,S.H., Canese,K., Church,D.M., DiCuccio,M., Edgar,R., Federhen,S., Helmberg,W., Kenton,D.L., Khovayko,O., Lipman,D.J., Madden,T.L., Maglott,D.R., Ostell,J., Pontius,J.U., Pruitt,K.D., Schuler,G.D., Schriml,L.M., Sequeira,E., Sherry,S.T., Sirotkin,K., Starchenko,G., Suzek,T.O., Tatusov,R., Tatusova,A., Wagner,L., Yaschenko,E. (2005). Database resources of the National Center for Biotechnology Information. Nucleic Acids Res., 33, 39-45.