<HTML>
<HEAD>
<TITLE>LREC 2000 - Paper 254 summary</title>
<SCRIPT LANGUAGE="JavaScript" TYPE="text/javascript">
<!--
// preload images:
 if(document.images)
  {
  hom_d= new Image(100,20);   hom_d.src="../eikones/hom_d.gif";
  pap_g=new Image(100,20);    pap_g.src="../eikones/pap_g.gif";  
  pap_d=new Image(100,20);    pap_d.src="../eikones/pap_d.gif";  
  pap_l=new Image(100,20);    pap_l.src="../eikones/pap_l.gif";  
  hom_l=new Image(100,20);    hom_l.src="../eikones/hom_l.gif";
  aut_d=new Image(100,20);     aut_d.src="../eikones/aut_d.gif";
  aut_l=new Image(100,20);    aut_l.src="../eikones/aut_l.gif";
  Key_d=new Image(100,20);    Key_d.src="../eikones/Key_d.gif";
  Key_l=new Image(100,20);   Key_l.src="../eikones/Key_l.gif";
  ses_d=new Image(100,20);    ses_d.src="../eikones/ses_d.gif";
  ses_l=new Image(100,20);   ses_l.src="../eikones/ses_l.gif";
  abs_l=new Image(100,20);   abs_l.src="../eikones/abs_l.gif";
  abs_d=new Image(100,20);    abs_d.src="../eikones/abs_d.gif";
  aut_l=new Image(100,20);    aut_l.src="../eikones/aut_l.gif";
}

function changimg(imgName,imgObjName)
 {
  if (document.images)
   {
   document.images[imgName].src=eval(imgObjName+".src");
   }
 }
//-->
</SCRIPT>

</HEAD>
<BODY marginwidth="0" marginheight="0" leftmargin="0" topmargin="0" rightmargin="0"  background="../eikones/fonto.jpg">
<TABLE align="center" border="0" width="100%" cellspacing="0" cellpadding="0" >
<TR>
<TD height="50" valign="center" colspan="7" bgcolor="#003163"><font face="Arial" size="4" color="#ffffff"><b>LREC 2000</b> 2<sup>nd</sup>
      International Conference on Language Resources &amp; Evaluation</font></TD>
</TR>
 <tr bgcolor="#003162">
 <td width="100" valign="center"><A href="../../default.htm" onmouseout="changimg('home','hom_d')" onmouseover="changimg('home','hom_l')"><IMG border="0" height="20" name="home" src="../eikones/hom_d.gif" width="100"></A></td>
 <TD width="100"><A href="../session.htm" onmouseout="changimg('sessions','ses_d')" onmouseover="changimg('sessions','ses_l')"><IMG border="0" height="20" name="sessions" src="../eikones/ses_d.gif" width="100"></A></TD>
 <TD width="100"><A href="../paper.htm" onmouseout="changimg('papers','pap_d')" onmouseover="changimg('papers','pap_l')"><IMG border="0" height="20" name="papers" src="../eikones/pap_d.gif" width="100"></a></TD>
 <TD width="100"><A href="../abstract.htm" onmouseout="changimg('abstracts','abs_d')" onmouseover="changimg('abstracts','abs_l')"><IMG border="0" height="20"  name="abstracts" src="../eikones/abs_d.gif" width="100"></A></TD>
 <TD width="100"><A href="../author.htm" onmouseout="changimg('authors','aut_d')" onmouseover="changimg('authors','aut_l')"><IMG border="0" height="20"  name="authors" src="../eikones/aut_d.gif" width="100"></a></TD>
 <TD width="100"><A href="../keyword.htm" onmouseout="changimg('keywords','Key_d')" onmouseover="changimg('keywords','Key_l')"><IMG border="0" height="20" name="keywords" src="../eikones/Key_d.gif" width="100"></A></TD>
<td width="1000">&nbsp;</td>
 </tr>
 </TABLE>
<BLOCKQUOTE style="MARGIN-RIGHT: 0px">
  <P><A href="253.htm">Previous Paper</A>&nbsp;&nbsp; <A href="256.htm">Next Paper</A></P></BLOCKQUOTE>
  <center>
<TABLE width="95%" Align="center" Border="1" bordercolor="#669999" cellspacing="1">
    <tr>
      <td width="15%" height="40"><b>Title</b></font></td>
      <td width="85%" height="40"><font color="#990033" size="4">TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation</font></td>
    </tr>
    <tr>
      <td height="40"><b>Authors</b></td>
      <td height="40"><font color="#006600">Folch Helka</font> (UMR8503 : Analyses de corpus linguistiques, usages et traitements, CNRS / ENS Fontenay/Saint-Cloud, 92211 Saint-Cloud, France, folch@ens-fcl.fr)<br><font color="#006600">Heiden Serge</font> (UMR8503 : Analyses de corpus linguistiques, usages et traitements, CNRS / ENS Fontenay/Saint-Cloud, 92211 Saint-Cloud, France, slh@ens-fcl.fr)<br><font color="#006600">Habert Benoît</font> (UMR8503 : Analyses de corpus linguistiques, usages et traitements, CNRS / ENS Fontenay/Saint-Cloud, 92211 Saint-Cloud, Francer, habert@limsi.fr)<br><font color="#006600">Fleury Serge</font> (UMR8503 : Analyses de corpus linguistiques, usages et traitements, CNRS / ENS Fontenay/Saint-Cloud, 92211 Saint-Cloud, France, fleury@ens-fcl.fr)<br><font color="#006600">Illouz Gabriel</font> (LIMSI, CNRS / Université Paris Sud, Orsay, France, illouz@limsi.fr)<br><font color="#006600">Lafon Pierre</font> (UMR8503 : Analyses de corpus linguistiques, usages et traitements, CNRS / ENS Fontenay/Saint-Cloud, 92211 Saint-Cloud, France, lafon@ens-fcl.fr)<br><font color="#006600">Nioche Julien</font> (UMR8503 : Analyses de corpus linguistiques, usages et traitements, CNRS / ENS Fontenay/Saint-Cloud, 92211 Saint-Cloud, France, nioche@ens-fcl.fr)<br><font color="#006600">Prévost Sophie</font> (UMR8503 : Analyses de corpus linguistiques, usages et traitements, CNRS / ENS Fontenay/Saint-Cloud, 92211 Saint-Cloud, France, prevost@ens-fcl.fr)</td>
    </tr>
    <tr>
      <td height="40"><b>Keywords</b></td>
      <td height="40">Corpus Based Linguistics, Evaluation, Multidimensional Statistics, Natural Language Processing, Software Architectur, Tagging, Text Typology</td>
    </tr>
      <tr>
      <td height="40"><b>Session</b></td>
      <td height="40">Session WO3 - Corpus Categorisation</td>
    </tr>
     <tr>
      <td height="40"><b>Full Paper</b></td>
            <td height="40"><a href="../../ps/254.ps" target="newps" type="application/postscript">254.ps</a>, <a href="../../pdf/254.pdf" target="newpdf" type="application/pdf">254.pdf</a></td>
    </tr>
      <tr>
      <td height="40"><b>Abstract</b></td>
             <td height="40">The increasing use of methods in natural language processing (NLP) which are based on huge corpora require that the lexical, morpho-syntactic and syntactic homogeneity of texts be mastered. We have developed a methodology and associate tools for text calibration or ''profiling'' within the ELRA benchmark called ''Contribution to the construction of contemporary french corpora'' based on multivariate analysis of linguistic features. We have integrated these tools within a modular architecture based on a generic model allowing us on the one hand flexible annotation of the corpus with the output of NLP and statistical tools and on the other hand retracing the results of these tools through the annotation layers back to the primary textual data. This allows us to justify our interpretations.</td>
    </tr>
  </table><br>
  </center>
</BODY>
</html>