On the last Sunday, 25th Nov., Chew-san and I went to Matsumoto to present a poster session at the 135th conference of the Linguistic Society of Japan. This society is the largest one in the linguistic fields in Japan.
Our session concerned language identification and the status of the languages on the Asian and African web. Unfortunately our session was assigned 11:30-13:10, a time for lunch, there were only twenty visitors. But they were interested in our research and asked suggestive questions:
-our research could be applied to an automatic identification of spoken language?
-how our identification engine would be utilized in the engineering fields?
-how is the situation of India? Hindi is widely used or not? (by a linguist of Indic language)
-how is the diachronic transition of the use of languages? how it could be analyzed from the sociolinguistic viewpoint?
A linguist of Indic language (the same person above) said to us that in India, those who can acsess the internet are received the higher educations and are skilled in English, and they have a tendency to use English at the social communications. I guess that the same situation should be exist in the most of Asian and African countries, but the speculation could be supported by the fact.She also said that it was very interesting to her that Bhojopuri, regarded as a dialect of Hindi, appeared in our survey.
The summary of our presentation could be referred to from the following address:
Mohd Zaidi Abd Rozan. “Trip Report: Barcelona, Spain & Setúbal, Portugal 2006 April 10th-13th" for the Japan Science and Technology (JST) Agency, JAPAN. (PDF-Report
Mohd Zaidi Abd Rozan, Yoshiki Mikami, "Knowledge Details in Web Forums: How High or Low above the Ground?", 19th International Federation of Information Processing (IFIP) World Computer Congress 2006
, Santiago, Chile, August 20-25, 2006. (PDF-Paper)
Mohd Zaidi Abd Rozan, Yoshiki Mikami, "Web Forum Provides Beneficial Knowledge: Analysis of Details by Kipling's Framework", Knowledge Management International Conference & Exhibition 2006 (KMICE'06)
, Kuala Lumpur, Malaysia, June 6-8, 2006. (PDF-Paper)
Rizza Caminero, Zavarsky Pavol, Yoshiki Mikami, "Status of the African Web", The 15th International World Wide Web Conference (WWW2006)
, Edinburgh, Scotland, 23-26 May 2006. (PDF/HTML
Katsuko T. Nakahira, Tetsuya Hoshino, Yoshiki Mikami, "Geographical Locations of African Servers", The 15th International World Wide Web Conference (WWW2006)
, Edinburgh, Scotland, 23-26 May 2006. (PDF/HTML
Mohd Zaidi Abd Rozan, Yoshiki Mikami, "Bahasa Sembang in Web Forums: Knowledge Management for Piles of Atopian Discourse", International Conference on Web Information Systems and Technologies 2006 (WEBIST2006)
, Setubal, Portugal, April 11-13, 2006. (PDF-Paper)
Zavarsky Pavol, Wunna Ko Ko, Yew Choong Chew, Yoshiki Mikami, Tatsuo Kobayashi, "Unicode Spreading on the Web: A case of Asian & African Domains", 29th Internationalization and Unicode Conference
, San Francisco, USA, March 2006. (PDF-Slides
Yoshiki Mikami, Ahmad Zaki Abu Bakar, Virach Sornlertlamvanich, Om Vikas, Zavarsky Pavol, Mohd Zaidi Abd Rozan, Göndri Nagy János, Tomoe Takahashi, Language Diversity On The Internet: An Asian View, in "Measuring Linguistic Diversity on the Internet", edited with an introduction by UNESCO Institute for Statistics
, Montreal Canada, UNESCO. pp.91-103, 2005. (PDF-Book: English version
, PDF-Book: French version
Pavol Zavarsky, Yoshiki Mikami, "Structural properties of the web of the Organization of Islamic Conference and Israel", NECEC 2005 (organized by IEEE Newfoundland and Labrador Section)
, Nov. 8, 2005, St. John's, Newfoundland Canada. (PDF-Paper
Pavol Zavarsky, Yoshiki Mikami, Shota Wada, "Language and encoding scheme identification of extremely large sets of multilingual text documents", The 10th Machine Translation Summit, pp.354-355, Phuket, Thailand, sept. 12-16, 2005. (PDF-Paper
Wunna Ko Ko, Yoshiki Mikami, "Languages of Myanmar in Cyberspace", Nagaoka University of Technology Bulletin on Language Science and Humanity, Vol. 19. pp.249‐264 (2005). (PDF-Paper
William Wizcarra, Yoshiki Mikami, "Endangered Latin American Languages and their place in the Cyberspace", Nagaoka University of Technology Bulletin on Language Science and Humanity, Vol. 19. pp.241‐247 (2005). (PDF-Paper
Mohd Zaidi Abd Rozan, Yoshiki Mikami, Ahmad Zaki Abu Bakar, Om Vikas.
"Multilingual ICT Education: Language Observatory as a Monitoring Instrument"
In Proceedings of South East Asia Regional Computer Confederation 2005
(SEARCC2005) in CRPIT
. pp.53-61, 28-30 September 2005, Sydney, AUSTRALIA. ISBN: 1-920682-28-7 (PDF-Paper
Yoshiki Mikami, "MMT PROJECT 1987-1996", The 10th Machine Translation Summit (MT Summit X)
, September 12-16 2005, Phuket, Thailand. (PDF-Slides
| "LANGUAGE/MT/NLP TIMELINE in MMT Project Member Countries - draft version" in PDF
Mohd Zaidi Abd Rozan, Yoshiki Mikami: "Extending Our Sense of Cyberspace Language Plurality: The Value of the Language Observatory (LO) Project", In Proceedings of The 10th International Conference on Translation
. pp.507-516, 02-04 August 2005, Kota Kinabalu, Sabah, MALAYSIA. ISBN: 983-3376-55-X (PDF-Paper
Wunna Ko Ko, Yoshiki Mikami, "Languages of Myanmar in Cyberspace", In Proceedings TALN & RECITAL 2005
(NLP for Under-Resourced Languages Workshop). pp.269-278, 10 June 2005 in Dourdan, France. ISBN: 2-9524255-0-7 (PDF-Paper
Mohd Zaidi Abd Rozan. “Malaysia Trip Report: The Significance of LOP" : January 2005 for the Japan Science and Technology (JST) Agency, JAPAN. (PDF-Report
Yoshiki Mikami, Zavarsky Pavol, Mohd Zaidi Abd Rozan, Izumi Suzuki, Masayuki Takahashi, Tomohide Maki, Irwan Nizan Ayob, Massimo Santini, Paolo Boldi, Sebastiano Vigna,
"The Language Observatory Project", In Poster Proceedings of the Fourteenth International World Wide Web Conference (WWW2005)
. pp.990-991,10-14 May 2005, Chiba, JAPAN. ISBN: 1-59593-051-5 (PDF-Poster
Yoshiki Mikami, "ICT Measures for Policy Development", The 3rd Asian Forum for Information Technology (AFIT), October 6-7, 2004, Bangkok. (PDF-Slides
Tomoe Takahashi, Katsuko Nakahira & Yoshiki Mikami, "Language Observatories in the World", Nagaoka University of Technology Bulletin on Language Science and Humanity, Vol. 18. pp.179‐198（2004）. (PDF-Paper: Japanese Version)
Yoshiki Mikami & Venkataraman Narayanan, "Information Technology Localization: The Past, Present and Future Agenda", SCALLA 2004
*, Kathmandu, Nepal, January 5-7, 2004.
SCALLA*: Sharing Capability in Localization and Human Language Technologies. (PDF-Paper
Izumi Suzuki, Yoshiki Mikami, Ario Ohsato, Yoshihide Chubachi, "A Language and Character Set Determination Method Based on N-gram Statistics", In ACM Transactions on Asian Language Information Processing, Vol.1, No.3, September 2002. pp.270-279. (PDF-Paper
Yoshiki Mikami,"Global digital-divide among scripts", VishwaBharat@tdil [विश्वभारत
], October 2002, p.1, New Delhi, INDIA. (PDF
Yoshiki Mikami,"Web Page Distribution by Language and by Domain: East/South Asia -- An Estimate Searched by Google --", February 5, 2002. (HTM