Marcus Ellery
About me
Latest Articles
Reproducibility, Version Control, and Integrity Risks in Research Software Workflows
Reading Time: 8 minutesResearch software problems are often described as reproducibility problems first. A result cannot be rerun, a notebook no longer executes, a dependency changed, or a released figure cannot be matched to the code a paper claims to use. That framing is useful, but it is incomplete. In software-based research, broken reproducibility is sometimes only the […]
Automated Document Screening Systems for Journal Submissions
Reading Time: 4 minutesAcademic publishing has created significant challenges for journal editors and reviewers. With increasing submission volumes across disciplines, maintaining high standards of quality, originality, and compliance has become more complex and time-consuming. Traditional manual screening processes are often insufficient to handle this scale efficiently. As a result, automated document screening systems have emerged as essential tools […]
Microservices-Based Architecture for Plagiarism Detection Platforms
Reading Time: 4 minutesThe increasing demand for academic integrity and content originality has driven the rapid evolution of plagiarism detection platforms. These systems must process large volumes of text, perform complex similarity analyses, and deliver results in real time to users across educational, corporate, and publishing environments. Traditional monolithic architectures often struggle to meet these requirements due to […]
Cross-Disciplinary Analysis of Plagiarism Patterns in Scientific Publications
Reading Time: 4 minutesScientific research is fundamental to the advancement of knowledge and innovation. However, the increasing pressure to publish, combined with the rapid expansion of digital information, has led to a rise in plagiarism across academic disciplines. Plagiarism in scientific publications is no longer confined to isolated incidents but has become a complex, multifaceted issue that varies […]
How to Apply Statistical Modeling to Engineering Data Streams at Scale
Reading Time: 4 minutesThe exponential growth of sensor networks, industrial automation, and interconnected systems has led to an unprecedented surge in engineering data streams. From smart grids and manufacturing lines to aerospace telemetry and autonomous vehicles, modern engineering systems generate continuous, high-volume, and high-velocity data. Effectively extracting value from these data streams requires advanced statistical modeling techniques capable […]
What IEEE-Style Engineering Conference Proceedings Reveal About Communications and Computer Systems Research
Reading Time: 7 minutesConference proceedings are often treated as a temporary layer of engineering culture: useful for tracking accepted papers, session titles, and the shifting language of a field, but rarely read as a structured signal in their own right. That is a mistake. In communications and computer systems research, proceedings often reveal something more valuable than the […]
Reliability Analysis of Intelligent Systems under Dynamic Conditions
Reading Time: 4 minutesAs intelligent systems become deeply integrated into engineering, industrial automation, and critical infrastructure, their reliability is no longer just a technical concern—it is a strategic necessity. From autonomous machines to AI-driven monitoring platforms, these systems operate in environments that are constantly changing. This makes reliability analysis of intelligent systems under dynamic conditions a crucial area […]
Performance Evaluation of Hybrid AI Models in Engineering Applications
Reading Time: 4 minutesEngineering applications are becoming increasingly complex, requiring advanced computational methods to process vast amounts of data and deliver accurate results. Traditional models, whether purely data-driven or physics-based, often struggle to balance accuracy, efficiency, and scalability. This challenge has led to the emergence of hybrid AI models, which combine multiple approaches to achieve superior performance. Performance […]
Detecting Conceptual Plagiarism Using Knowledge Graph Reasoning
Reading Time: 5 minutesPlagiarism in academic writing has evolved far beyond simple verbatim copying. Conceptual plagiarism, where ideas or arguments are borrowed without proper attribution, presents one of the most challenging problems for modern plagiarism detection systems. Unlike textual plagiarism, conceptual plagiarism may involve paraphrasing, restructuring, or entirely rewording ideas, making traditional string-matching approaches insufficient for detection. Recent […]
AI-Assisted Paraphrasing and Its Impact on Plagiarism Detection Systems
Reading Time: 4 minutesArtificial intelligence has rapidly transformed the landscape of academic writing. Among the most widely used technologies are AI-powered paraphrasing tools that allow users to rewrite existing content while preserving its original meaning. While these tools can support legitimate writing tasks such as editing and language improvement, they also introduce new challenges for plagiarism detection systems. […]
Benchmarking Plagiarism Detection Algorithms on Large Academic Datasets
Reading Time: 4 minutesBenchmarking plagiarism detection algorithms has become an essential research direction as digital academic publishing continues to expand. Universities, research institutions, and scholarly journals now manage enormous volumes of written material every year. With millions of research papers, theses, conference submissions, and technical reports being produced globally, ensuring originality has become a critical component of academic […]
Explainable Plagiarism Detection Systems: Interpretable AI for Editorial Decision-Making
Reading Time: 4 minutesDigital content has intensified the need for reliable plagiarism detection systems. Editors, reviewers, and academic institutions face an overwhelming volume of submissions daily, making the manual verification of originality nearly impossible. Traditional plagiarism detection tools, while effective at identifying text similarity, often operate as “black boxes,” providing scores and flags without clarifying the underlying reasoning. […]
Self-Supervised Learning Approaches for Detecting Disguised Academic Plagiarism
Reading Time: 4 minutesAcademic plagiarism has evolved far beyond simple copy-and-paste behavior. Today, disguised plagiarism—where original content is rephrased, translated, structurally modified, or algorithmically paraphrased—poses a significant challenge for journals, universities, and research institutions. Traditional string-matching systems struggle to detect semantic similarity when surface-level wording has been altered. As a result, modern detection strategies increasingly rely on machine […]
Multimodal Plagiarism Detection in Text, Source Code, and Presentation Files
Reading Time: 3 minutesСontent is no longer confined to plain text. Researchers, students, and developers often produce a mixture of textual documents, source code, and presentation materials. While this multimodal approach enriches communication and knowledge sharing, it also creates new challenges for plagiarism detection. Traditional plagiarism tools primarily focus on a single modality, such as text, leaving other […]
Semantic Embedding Techniques for Advanced Research Content Similarity Measurement
Reading Time: 4 minutesThe exponential growth of scientific publications and research outputs has created both opportunities and challenges in knowledge management. Researchers, institutions, and publishers increasingly need to assess the similarity of research content to ensure originality, detect potential plagiarism, and identify overlapping work. Traditional methods based on keyword matching, citation analysis, or n-gram comparison often fail to […]
Cross-Language Plagiarism Detection Using Multilingual Transformer Architectures
Reading Time: 4 minutesThe globalization of scientific communication has significantly increased the production and exchange of multilingual academic content. Researchers routinely translate articles, adapt conference papers for international journals, and publish findings in multiple linguistic contexts. While this expansion strengthens global collaboration, it also creates new vulnerabilities in research integrity. Cross-language plagiarism, where text is translated and reused […]
Blockchain for Academic Integrity: Ensuring Tamper-Proof Research Records
Reading Time: 4 minutesAcademic integrity is fundamental to the credibility and sustainability of scientific progress. As research outputs continue to expand across digital platforms, concerns related to data manipulation, falsification, plagiarism, and authorship disputes have intensified. Traditional record-keeping systems, often centralized and vulnerable to tampering, struggle to guarantee transparency and traceability. Blockchain technology, with its decentralized and immutable […]
AI-Powered Plagiarism Detection in Scientific Publications: Techniques and Challenges
Reading Time: 3 minutesMaintaining research integrity is essential for the credibility of scientific publications. With the growing volume of research output, traditional manual plagiarism detection methods are becoming insufficient. AI-powered plagiarism detection tools offer scalable, accurate, and intelligent solutions to identify content similarity, prevent misconduct, and ensure ethical scholarly practices. This article explores the techniques used in AI-driven […]
Measuring Research Integrity: Automated Content Similarity and Plagiarism Analysis
Reading Time: 3 minutesResearch integrity is a cornerstone of scientific progress, ensuring that published findings are accurate, original, and ethically conducted. With the exponential growth of scholarly content, traditional manual methods for detecting plagiarism and content duplication have become insufficient. Automated content similarity and plagiarism analysis tools have emerged as essential instruments for maintaining research integrity. This article […]
Statistical Modeling of Large-Scale Engineering Data Streams
Reading Time: 4 minutesThe increasing digitalization of engineering systems has fundamentally transformed the way operational data are generated, collected, and analyzed. Modern engineering infrastructures continuously produce massive volumes of data through distributed sensors, embedded control systems, and interconnected cyber-physical components. These large-scale data streams reflect the real-time dynamics of complex systems in domains such as industrial automation, energy […]