Utilize este identificador para referenciar este registo:
https://hdl.handle.net/1822/16472
Registo completo
Campo DC | Valor | Idioma |
---|---|---|
dc.contributor.author | Simões, Alberto | - |
dc.contributor.author | Almeida, J. J. | - |
dc.date.accessioned | 2012-01-18T16:04:04Z | - |
dc.date.available | 2012-01-18T16:04:04Z | - |
dc.date.issued | 2009 | - |
dc.identifier.isbn | 978-989-96278-1-9 | - |
dc.identifier.uri | https://hdl.handle.net/1822/16472 | - |
dc.description.abstract | The Marker Hypothesis was first defined by Thomas Green in 1979. It is a psycho-linguistic hypothesis defining that there is a set of words in every language that marks boundaries of phrases in a sentence. While it remains a hypothesis because nobody has proved it, tests have shows that results are comparable to basic shallow parsers with higher efficiency. The chunking algorithm based on the Marker Hypothesis is simple, fast and almost language independent. It depends on a list of closed-class words, that are already available for most languages. This makes it suitable for bilingual chunking (there is not the requirement for separate language shallow parsers). This paper discusses the use of the Marker Hypothesis combined with Probabilistic Translation Dictionaries for example-based machine translation resources extraction from parallel corpora. | por |
dc.language.iso | eng | por |
dc.publisher | Designeed | por |
dc.rights | openAccess | por |
dc.subject | Parallel corpora | por |
dc.subject | Text segmentation | por |
dc.title | Bilingual example segmentation based on markers hypothesis | por |
dc.type | conferencePaper | - |
dc.peerreviewed | yes | por |
sdum.publicationstatus | published | por |
oaire.citationStartPage | 95 | por |
oaire.citationEndPage | 98 | por |
oaire.citationTitle | I Iberian SLTech 2009 : Proceedings of the I Joint SIG-IL/Microsoft Workshop on Speech and Language Technologies for Iberian Languages | por |
sdum.conferencePublication | I Iberian SLTech 2009 : Proceedings of the I Joint SIG-IL/Microsoft Workshop on Speech and Language Technologies for Iberian Languages | - |
Aparece nas coleções: | DI/CCTC - Artigos (papers) |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
markers09.pdf | Documento principal | 123,3 kB | Adobe PDF | Ver/Abrir |