Priya Nandakumar
About me
Latest Articles
hreshold Calibration Techniques for Semantic Similarity Classifiers
Reading Time: 10 minutesSemantic similarity classifiers are used to identify meaning-level overlap between texts, even when the wording is different. They can help detect paraphrased content, near-duplicate articles, repeated intent, source dependence, or suspicious similarity that exact-match systems may miss. However, the classifier’s score is only useful when it is translated into a practical decision. A semantic similarity […]
Approximate Nearest Neighbor Index Design for Plagiarism Search at Scale
Reading Time: 8 minutesPlagiarism search becomes much harder when a platform moves from checking a few documents to comparing millions of submissions, web pages, institutional files, and academic sources. A small system can rely on exact matching, n-grams, shingles, or direct database lookups. At scale, however, exhaustive comparison quickly becomes too slow, too expensive, and too difficult to […]
Chunking Strategies for High-Recall Document Similarity Retrieval
Reading Time: 8 minutesIn document similarity retrieval, teams often focus on embeddings, vector search, rerankers, and evaluation metrics. Those components are important, but many systems lose quality much earlier, at the point where documents are divided into retrievable units. That step, known as chunking, has a direct effect on recall. It determines what the retriever can compare, how […]
Vector Embedding Techniques for Plagiarism Detection
Reading Time: 4 minutesAs digital content proliferates across the internet and academic repositories, plagiarism detection has become a critical concern for educational institutions, publishers, and organizations. Traditional plagiarism detection methods, which rely on exact string matching or simple pattern recognition, often fail to capture semantic similarities or sophisticated paraphrasing. To overcome these limitations, vector embedding techniques have emerged […]
Hardware Optimization Techniques for Neural Network Inference
Reading Time: 4 minutesEngineering domains has led to an increasing demand for efficient neural network inference. While training deep learning models often occurs in high-performance computing environments, inference typically needs to be executed in real-time and resource-constrained settings such as embedded systems, edge devices, and industrial hardware. This shift has made hardware optimization a critical aspect of deploying […]
Transfer Learning Techniques for Engineering Applications
Reading Time: 5 minutesArtificial intelligence and machine learning has significantly transformed modern engineering practices. However, one of the persistent challenges in deploying machine learning models in engineering domains is the lack of large, labeled datasets required for effective training. Transfer learning has emerged as a powerful solution to this problem, enabling models trained on one task or domain […]
Advanced Scheduling Algorithms for Real-Time Systems
Reading Time: 4 minutesReal-time systems are ubiquitous in modern technology, powering applications ranging from autonomous vehicles and industrial automation to aerospace control and medical devices. These systems must perform tasks within strict timing constraints, making efficient and reliable scheduling a cornerstone of their design. Advanced scheduling algorithms for real-time systems are therefore essential to ensure that tasks execute […]
Efficient Memory Management Techniques for Embedded Platforms
Reading Time: 4 minutesEmbedded platforms are the backbone of modern electronic devices, powering everything from smartphones and wearable technology to industrial controllers and automotive systems. Unlike general-purpose computing systems, embedded platforms operate under strict resource constraints, including limited memory, processing power, and energy budgets. Efficient memory management techniques for embedded platforms are therefore critical to ensuring optimal performance, […]
GPU-Accelerated AI Pipelines for Real-Time Academic Plagiarism Detection
Reading Time: 4 minutesGPU-accelerated plagiarism detection is rapidly transforming how universities, research institutions, and academic publishers verify the originality of scholarly documents. As academic databases expand to millions of research papers, theses, and technical reports, traditional CPU-based plagiarism detection systems face increasing computational limitations. Real-time plagiarism detection requires the ability to compare newly submitted texts against massive repositories […]
Efficient Transformer Architectures for High-Precision Large-Scale Academic Text Analysis
Reading Time: 4 minutesThe growth of scholarly publications worldwide has created unprecedented challenges for analyzing academic texts at scale. Traditional natural language processing methods are increasingly insufficient for processing millions of documents efficiently, particularly when semantic accuracy is critical. Transformer-based models such as BERT, GPT, and their derivatives have revolutionized academic text analysis by capturing context, semantics, and […]
Vector Embedding Optimization for High-Precision Document Similarity Search
Reading Time: 4 minutesAccurate document similarity search is fundamental for plagiarism detection, semantic analysis, and large-scale academic content evaluation. Traditional text-matching algorithms have gradually been supplemented by vector embedding techniques, which encode textual information into high-dimensional numerical representations that capture semantic and contextual relationships. Despite significant advances, the precision and efficiency of document similarity searches depend heavily on […]
Adversarial Attacks on Plagiarism Detection Systems and Robust Countermeasures
Reading Time: 3 minutesNew threats are emerging in the form of adversarial attacks, when academic integrity becomes increasingly reliant on automated plagiarism detection systems. These attacks involve deliberately modifying text, code, or other research outputs to evade detection, while retaining the underlying content. With the proliferation of AI-based paraphrasing tools, machine translation, and text generation models, adversarial techniques […]
Real-Time Plagiarism Detection in Distributed Cloud-Based Educational Systems
Reading Time: 4 minutesCloud-based educational platforms has transformed modern learning environments, enabling students to access materials, submit assignments, and collaborate online from virtually anywhere. While these distributed systems enhance accessibility and scalability, they also create new challenges for maintaining academic integrity. Plagiarism, both intentional and unintentional, remains a significant concern as students increasingly rely on online resources. Traditional […]
Graph-Based Code Similarity Analysis for Large-Scale Software Plagiarism Detection
Reading Time: 4 minutesComputer science education, open-source collaboration, and distributed software development has significantly increased the volume of publicly available code. While this growth supports innovation and knowledge sharing, it also intensifies the challenge of detecting software plagiarism. In academic environments, students may modify copied programs to evade detection, while in industry proprietary algorithms may be reused without […]
Self-Healing Networks: AI Approaches for Fault Detection and Recovery
Reading Time: 3 minutesModern networks are becoming increasingly complex, dynamic, and critical to both business and infrastructure operations. Traditional network management approaches often struggle to maintain reliability in the face of faults, congestion, or cyber-attacks. Self-healing networks, powered by artificial intelligence, aim to detect, diagnose, and automatically recover from failures in real-time. This article explores the principles behind […]
Low-Latency Data Transmission in Wireless Ad Hoc Networks
Reading Time: 4 minutesWireless ad hoc networks represent a flexible communication paradigm in which nodes cooperate dynamically without relying on fixed infrastructure. This decentralized architecture makes ad hoc networks particularly suitable for scenarios such as emergency response, battlefield communications, vehicular systems, and temporary sensor deployments. In many of these applications, real-time data exchange is critical, placing stringent requirements […]
Detecting Mosaic Plagiarism with Advanced Text Mining
Reading Time: 4 minutesPlagiarism continues to be one of the most persistent challenges in academic and scientific writing. Although modern plagiarism detection systems are effective at identifying direct copying, more sophisticated forms of plagiarism remain difficult to detect. Mosaic plagiarism, often referred to as patchwork plagiarism, involves rephrasing and recombining fragments from multiple sources into a new text […]
Machine Learning Approaches for Network Traffic Classification
Reading Time: 4 minutesNetwork traffic classification is a critical aspect of modern computer networks, enabling administrators to monitor, manage, and secure data flows across complex infrastructures. Traditional methods based on port numbers, protocol signatures, or rule-based filtering are increasingly insufficient due to the rapid growth of encrypted traffic, dynamic applications, and heterogeneous devices. In response, machine learning (ML) […]
P2P Attribute-Based Communication in Mobile Ad Hoc Networks over Wi-Fi
Reading Time: 3 minutesMobile communication has evolved far beyond traditional infrastructure-based networks. As smartphones and wireless devices become more powerful, the demand for decentralized, flexible, and infrastructure-independent communication continues to grow. Mobile Ad Hoc Networks, commonly referred to as MANETs, address this need by allowing devices to form temporary networks without relying on centralized access points or servers. […]
Enhancing Facial Feature Extraction: Technical Insights into DOG-Based Granulation for High-Precision Face Recognition
Reading Time: 3 minutesFace recognition systems have become increasingly vital in applications ranging from security authentication to personalized user experiences. Central to the success of these systems is the accuracy and robustness of facial feature extraction. The Difference of Gaussian (DOG) method, a cornerstone in image processing, offers a compelling approach to enhancing feature granularity for high-precision face […]