15.450 new on WWW: ELRA resources

From: Humanist Discussion Group (by way of Willard McCarty (w.mccarty@btinternet.com)
Date: Sat Jan 12 2002 - 01:44:19 EST

  • Next message: Humanist Discussion Group (by way of Willard McCarty : "15.449 scholarly e-publishing; public domain suits"

                   Humanist Discussion Group, Vol. 15, No. 450.
           Centre for Computing in the Humanities, King's College London
                   <http://www.princeton.edu/~mccarty/humanist/>
                  <http://www.kcl.ac.uk/humanities/cch/humanist/>

             Date: Sat, 12 Jan 2002 06:36:29 +0000
             From: Magali Duclaux <duclaux@elda.fr>
             Subject: ELRA News

    ************************************************************
    ELRA - European Language Resources Association
    ************************************************************

    We are pleased to announce the new resources
    available in our catalogue of language resources:

    ELRA W0030 Arabic Data Set
    ELRA W0031 GeFRePaC - German French Reciprocal
    Parallel Corpus

    A short description of these two new resources is given
    below.
    Please visit the online catalogue to get further details:
    http://www.elda.fr/catalog.html

    ELRA W0030 Arabic Data Set:
    The corpus contains Al-Hayat newspaper articles with
    value added for Language Engineering and Information
    Retrieval applications development purposes. Data has
    been organised in 7 subject specific databases according
    to the Al-Hayat subject tags. Mark-up, numbers, special
    characters and punctuation have been removed. The size
    of the total file is 268 MB. The dataset contains 18,639,264
    distinct tokens in 42,591 articles, organised in 7 domains.

    ELRA W0031 GeFRePaC - German French Reciprocal
    Parallel Corpus:
    GeFRePac was produced in the framework of the LRsP&P
    project. It contains 30 million words : 15 million for the
    German language, 15 million for the French language.
    It covers natural general language as used in
    public socio-political discourse and it has a focus on
    multilingual administration and commercial and legal
    documentation. It was created for the purpose of
    developing, enhancing and improving translation aids.

    =====================================
    For further information, please contact:

    ELRA/ELDA
    55-57 rue Brillat-Savarin
    F-75013 Paris, France

    Tel: +33 01 43 13 33 33
    Fax: +33 01 43 13 33 30

    E-mail mapelli@elda.fr

    or visit our Web site:
    http://www.icp.grenet.fr/ELRA/home.html
    or http://www.elda.fr
    =====================================



    This archive was generated by hypermail 2b30 : Sat Jan 12 2002 - 01:53:27 EST