Gujarati (ગુજરાતી) spell checker using Norvig algorithm with grammatical rules ofRhasva, Dīrgha and Anusvāra
DOI:
https://doi.org/10.15649/2346030X.4446Keywords:
rule-based, gujarati, machine learning (ml), grammatical rule rhasva, dīrgha, anusvāra, natural language processing (nlp), gnlp, artificial intelligence (ia)Abstract
This paper focused on Gujarati Spell Checker Using Norvig Algorithm with Grammatical Rules of Rhasva, Dīrgha, Anusvāra. Researchers make a note of Gujarati language history with a brief explanation of consonants and vowels. A total of 5, 5 and 14 rules are available respectively for Anusvāra, Rhasva, and Dīrgha. Many Indian languages use the term Anusvara to denote a nasal sound with a specific symbol. A single mātrā Small E-i (હ્રસ્વ ઇ) / U-u (હ્રસ્વ ઉ – ુુ) is called Rhasva in gujarati literature while double mātrā Long E-e (દીર્ઘ ઈ – ુી) / Long U-oo (દીર્ઘ ઊ- ુ ) known as Dīrgha respectively. This manuscript constitutes an innovative effort in the domain of Gujarati Natural Language Processing (GNLP) investigation, presenting a comprehensive repository of grammatical rules for the Gujarati language, inclusive of examples in both English and Gujarati. The proposed system will use given grammatical rules. Researcher uses two version of dataset for Gujarati Spell Checker with Norvig Algorithm in replace and insert function. One has whole barakhdi and second one focuses on Rhasva, Dīrgha, Matras, and Anusvāra in algorithm process. Output of this paper may explain that the deep learning approach is more appropriate compare to other approach.
References
[1] K. Vyas “Vikas, Swarup, and Unviversity Granth Nirman Board”, ISBN: 978-93-81265-98-7.
[2] B. Thakur, M. Upreti, P. Sahagal, H. Joshi, S. Hardikar, V. Ratalani, B.B. Jyoti “Gujarati: A Textbook for Learning Gujarati Through Hindi From the Central Institute of Indian Languages”.
[3] W. S. Tisdall, “A Simplified Grammar of the Gujarati Language: Together With A Short Reading Book and Vocabulary”. London: Kegan Paul, Trench, Trúbner, 1892.
[4] P. J. Mistry, “Gujarati writing,” in The World’s Writing Systems, B. Daniels, Ed. Oxford University Press, 1996.
[5] G. Cardona “A Gujarati Reference Grammar”. University of Pennsylvania Press, 1965.
[6] N. Patel and D. Patel, “‘Implementation Approach of Indian Language Gujarati Grammar’s Concept “sandhi’ using the Concepts of Rule-based NLP,” 8th International Conference on Computing for Sustainable Global Development (INDIA Com), New Delhi, India, vol. 2021, pp. 481–485. 2021.
[7] Gujarati. Available at: languagesgulper.com.
[8] W. S. Tisdall, “A simplified grammar of the Gujarati language,” vol. 22, Рипол Классик, 1892.
[9] G. Cardona and B. Suthar, “The Indo-Aryan languages,” in Gujarati, G. Cardona, Jain, and Dhanesh, Eds. Routledge, ISBN 978-0-415-77294-5, 2003.
[10] B. Suthar, ‘A Brief Outline of Gujarati Parts-of-Speech (POS)’, A Nirman Foundation Project. Philadelphia: Department of South Asia Studies, University of Pennsylvania, 2003.
[11] R. Soni and G. Lekhan-Paddhati (Gujarati), Publisher: Gurjar, ISBN-: 9351753409.
[12] R. B. K. P. Trivedî, Higher Grammar of the Gujarati Language. Macmillan and Company, 1919.
[13] Y. Vyas and G. B. Vyakran (Gujarati), By Publisher: Balvinod Prakashan, ISBN: 978-93-8478-006-7.
[14] What Is a Rule-Based System? Available at: j-paine.org.
[15] S. V. Maniya, Ms. J. Sheth, and Dr. K. Lad, “Compression Technique based on Dictionary approach for Gujarati Text,” Int. J. Eng. Res. Dev. eISSN : 2278-067X, pISSN : 2278-800X. Available at: http://www.ijerd.com, vol. 4, no. 8, pp. 101–108, Nov. 2012.
[16] A. Abraham “Intelligent Systems: A Modern Approach. Springer Science+Business Media,” pp 149 2011. ISBN 978-3-642-21004-4.
[17] H. Liu, A. Gegov, and F. Stahl, “Categorization and construction of rule based systems,” in Commun. Comput. Inf. Sci.. EANN vol. 459, 2014.
[18] B. G. Buchanan and R. O. Duda, “Principles of rule-based expert systems,” in Adv. Comput. Elsevier, vol. 22, 1983. doi: 10.1016/S0065-2458(08)60129-1.
[19] J. V. Julio and C. Pérez, “24th International Florida Artificial Intelligence Research Society”, FLAIRS - 24. Sara & González-Cristóbal: Sonia & Serrano, 2011.
[20] A. J. Szanser, “Automatic error-correction in natural languages,” Inf. Storage Retrieval, vol. 5, no. 4, pp. 169–174, 1970. doi: 10.1016/0020-0271(70)90045-8.
[21] P. Kumar, A. Kannan, and N. Goel, “Design and implementation of NLP-based spell checker for the Tamil language,” in Proc. 1st International Electronic Conference on Applied Sciences. Basel, Switzerland: MDPI, Nov. 10–30, 2020.
[22] C. Patil, R. Rodrigues, and R. Ron, “Auto-spelling checker using natural language processing,nternational research,” J. Eng. Technol. (IRJET) e-ISSN: 2395-0056, vol. 07, pp-794–796, Aug 2020.
[23] A. N. M. Fahim Faisal, M. A. Rahman, and T. Farah, “A rule-based Bengali grammar checker,” Fifth World Conference on Smart Trends in Systems Security and Sustainability (WorldS4), London, United Kingdom, vol. 2021, pp. 113–117, 2021.
[24] Y. Gondaliya, P. Kalariya, B. Y. Panchal, and A. Nayak, “A rule-based grammar and spell checking,” SAMRIDDHI A J. Phys. Sci. Eng. Technol., vol. 14, no. 1, pp. 48–54, 2022.
[25] A. Istiak & J. maliha, T. Zarin, T. Reza, S. M. Salim, Hossain, and Dilshad, “Spell Corrector for Bangla Language Using Norvig’s Algorithm and Jaro–Winkler Distance” Bulletin of Electrical Engineering and Informatics. vol. 10. pp. 1997-2005, 2021.
[26] H. Patel, B. Patel, and K. Lad, “Jodani: A spell checking and suggesting tool for Gujarati language,” Data Sci. Eng. (Confluence), (Noida, India) 11th International Conference on Cloud Computing, vol. 2021, pp. 94–99, 2021.
[27] A. Bhansali, A. Chandravadiya, B. Y. Panchal, M. H. Bohara, and A. Ganatra, “Language identification using combination of machine learning algorithms and vectorization techniques,” 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, vol. 2022, pp. 1329–1334, 2022.
[28] B. Y. Panchal and A. Shah, “Spell checker using Norvig algorithm for Gujarati language,” in. ICSMDI 2024. Algorithms for Intelligent Systems, R. Asokan, D. P. Ruiz, and S. Piramuthu, Eds. Smart Data Intelligence. Singapore: Springer, 2024.
[29] J. Baxi and B. Bhatt, “A bidirectional LSTM-based morphological analyzer for Gujarati,” Nat. lang. processing. Cambridge University Press, pp. 1–17, 2024.
[30] N. G. Patel and D. B. Patel, “NLP-based processing of Gujarati compound word sandhi’s generation and segmentation,” in International Conference on Universal Threats in Expert Applications and Solutions. Singapore: Springer Nature Singapore, pp. 263–271, Jan., 2024.
Downloads
Published
How to Cite
Downloads
Issue
Section
License
Copyright (c) 2025 AiBi Journal of Research, Administration and Engineering

This work is licensed under a Creative Commons Attribution 4.0 International License.
The journal offers open access under a Creative Commons Attibution License

This work is under license Creative Commons Attribution (CC BY 4.0).







