Efficient topical focused crawling through neighborhood feature

T Suebchua, B Manaskasemsak… - New Generation …, 2018 - Springer
A focused web crawler is an essential tool for gathering domain-specific data used by
national web corpora, vertical search engines, and so on, since it is more efficient than …

Lietuvių kalbos terminų automatinis atpažinimas ir apibrėžimas

A Bielinskiene, L Boizou, G Grigonyté, J Kovalevskaite… - 2015 - diva-portal.org
Lietuvoje gausėjant kompiuterinės ir tekstynų lingvistikos tyrimų, didėjant kalbinių resursų
įvairovei, formuojasi palankesnės sąlygos kurti sudėtingesnius natūraliosios kalbos analizės …

A framework of concepts in the migration domain and their expression in English and Lithuanian

O Usinskiene - 2024 - vb.mruni.eu
Abstract [eng] The topic of migration has become increasingly significant for contemporary
discussion due to its sharp rise. People migrate for a variety of reasons, such as seeking …

Švietimo ir mokslo terminų žodynas ir ontologija

I Markievicz, E Rimkutė - Terminologija/Terminology, 2013 - ceeol.com
ĮVADAS 2010–2012 m. Vytauto Didžiojo universiteto Kompiuterinės lingvistikos centre
vykdytas Lietuvos mokslo tarybos finansuotas projektas Švietimo ir mokslo terminų …

Alignement lexical en corpus comparables: le cas des composés savants et des adjectifs relationnels

R Harastani - 2014 - theses.hal.science
Notre travail concerne l'extraction automatique d'une liste de termes alignés avec leurs
traductions (c'est-à-dire un lexique bilingue spécialisé) à partir d'un corpus comparable …

[PDF][PDF] Building of parallel and comparable cybersecurity corpora for bilingual terminology extraction

A Utka, L Mockienė, M Laurinaitis… - Selected Papers from …, 2022 - cris.mruni.eu
The paper aims at presenting English-Lithuanian corpora for bilingual term extraction (BiTE)
in the cybersecurity domain within the framework of the project DVITAS. It is argued that a …

[PDF][PDF] PARALLEL AND COMPARABLE CORPORA FOR TERMINOLOGY ANALYSIS IN THE DOMAIN OF MIGRATION

O UŠINSKIENĖ, S RACKEVIČIENĖ - LANGUAGE FOR …, 2023 - apgads.lu.lv
The aim of the paper is to present the bilingual (English–Lithuanian) corpora compiled for
research on specialised language in the domain of migration. The topic of migration is found …

Corpora for Bilingual Terminology Extraction in Cybersecurity Domain

A Utka, L Mockienė, M Laurinaitis… - … of CLARIN Annual …, 2021 - cris.mruni.eu
The paper aims at presenting English-Lithuanian corpora for bilingual term extraction (BiTE)
in the cybersecurity domain within the framework of the project DVITAS. It is argued that a …

[PDF][PDF] A Study on Efficient Topical Focused Website Segment Crawler

T Suebchua - (No Title), 2018 - core.ac.uk
Topic-specific web pages are essential data for vertical search engines and natural
language processing (NLP) researches. To acquire these web pages, many researchers …

[PDF][PDF] A Study on Efficient Topical Focused Website Segment Crawler

スブチュアタナポール - 2018 - waseda.repo.nii.ac.jp
Topic-specific web pages are essential data for vertical search engines and natural
language processing (NLP) researches. To acquire these web pages, many researchers …