SUMMARY : Session P14-GW

 

Title Next Generation Language Resources using Grid
Authors F. Calzolari, E. Sassolini, M. Sassi, S. Cucurullo, E. Picchi, F. Bertagna, A. Enea, M. Monachini, C. Soria, N. Calzolari
Abstract This paper presents a case study concerning the challenges and requirements posed by next generation language resources, realized as an overall model of open, distributed and collaborative language infrastructure. If a sort of “new paradigm” for language resource sharing is required, we think that the emerging and still evolving technology connected to Grid computing is a very interesting and suitable one for a concrete realization of this vision. Given the current limitations of Grid computing, it is very important to test the new environment on basic language analysis tools, in order to get the feeling of what are the potentialities and possible limitations connected to its use in NLP. For this reason, we have done some experiments on a module of the Linguistic Miner, i.e. the extraction of linguistic patterns from restricted domain corpora. The Grid environment has produced the expected results (reduction of the processing time, huge storage capacity, data redundancy) without any additional cost for the final user.
Keywords grid, acquisition, topic classification
Full paper Next Generation Language Resources using Grid