Academic

Generating Realistic, Protocol-Compliant Maritime Radio Dialogues using Self-Instruct and Low-Rank Adaptation

arXiv:2603.04423v1 Announce Type: new Abstract: VHF radio miscommunication remains a major safety risk in maritime operations, with human factors accounting for over 58% of recorded incidents in Europe between 2014 and 2023. Despite decades of operational use, VHF radio communications are still prone to noise, interference, linguistic variability, and the absence of real-time transcription, making procedural errors both frequent and difficult to correct. Developing AI-assisted systems to support real-time communication and decision-making requires a considerable amount of high-quality maritime data, yet operational, regulatory, and privacy constraints render such datasets scarce. This study introduces a compliance aware Self-Instruct methodology for generating realistic maritime radio dialogues that conform to the IMO's SMCP. Our approach integrates a 26-filter verification pipeline directly into the iterative generation loop to enforce entity information accuracy, hallucination detec

G
G\"ursel Akdeniz, Emin Cagatay Nakilcioglu
· · 1 min read · 2 views

arXiv:2603.04423v1 Announce Type: new Abstract: VHF radio miscommunication remains a major safety risk in maritime operations, with human factors accounting for over 58% of recorded incidents in Europe between 2014 and 2023. Despite decades of operational use, VHF radio communications are still prone to noise, interference, linguistic variability, and the absence of real-time transcription, making procedural errors both frequent and difficult to correct. Developing AI-assisted systems to support real-time communication and decision-making requires a considerable amount of high-quality maritime data, yet operational, regulatory, and privacy constraints render such datasets scarce. This study introduces a compliance aware Self-Instruct methodology for generating realistic maritime radio dialogues that conform to the IMO's SMCP. Our approach integrates a 26-filter verification pipeline directly into the iterative generation loop to enforce entity information accuracy, hallucination detection, SMCP-compliance, logical consistency, and linguistic diversity. We employ LORA for parameter-efficient fine-tuning, reducing computational overhead during training and enabling efficient deployment of the resulting models on resource-constrained maritime systems. To assess dataset quality, we introduce a novel evaluation framework combining automated and expert assessments: Format Accuracy, Information Accuracy, Uniqueness, and Logical Coherence. Experiments using publicly available vessel, coastal and AIS datasets demonstrate that the approach produces synthetically diverse, procedurally compliant, and operationally realistic dialogues. Although downstream applications such as automatic speech recognition and natural language processing are reserved for future work, the released code, datasets, and verification tools provide a reproducible foundation for artificial intelligence-assisted maritime safety and other safety-critical domains.

Executive Summary

This article presents a novel AI-driven methodology to generate realistic, protocol-compliant maritime radio dialogues using Self-Instruct and Low-Rank Adaptation (LORA). Addressing a critical safety concern—VHF radio miscommunication—the study leverages compliance-aware generation with a 26-filter verification pipeline to mitigate hallucination, ensure SMCP adherence, and enhance linguistic diversity. The use of LORA for efficient fine-tuning is particularly commendable for scalability in resource-constrained maritime environments. The evaluation framework combining automated and expert assessments adds credibility to the synthetic data quality. Overall, the work provides a reproducible, technically sound foundation for AI-assisted maritime safety.

Key Points

  • Integration of compliance-aware Self-Instruct with LORA for efficient training

Merits

Strength in Compliance Enforcement

The 26-filter verification pipeline directly embedded in the generation loop is a significant innovation, offering a structured mechanism to enforce SMCP compliance, entity accuracy, and logical consistency—critical for safety-critical applications.

Efficient Deployment via LORA

Utilizing LORA for parameter-efficient fine-tuning demonstrates pragmatic engineering: reducing computational costs while enabling scalable deployment on maritime systems with limited resources.

Demerits

Evaluation Framework Limitation

While the evaluation framework (Format Accuracy, Information Accuracy, Uniqueness, Logical Coherence) is robust, its reliance on expert assessment introduces subjectivity and potential scalability issues in large-scale deployment or cross-regional validation.

Scope Constraint

The study explicitly states downstream applications (e.g., ASR, NLP) are reserved for future work, limiting immediate applicability and creating a gap between generation and operational integration.

Expert Commentary

From a legal and academic perspective, this study occupies a pivotal intersection between AI ethics, maritime safety, and regulatory compliance. The integration of compliance constraints into generative AI models—specifically within the maritime domain—is a novel and necessary evolution beyond generalist LLMs. The authors recognize the critical legal implications: miscommunication on VHF radio is not merely a technical issue but a liability and regulatory concern. By embedding compliance verification as an iterative, automated checkpoint, they effectively preempt potential legal risks arising from AI-generated content that could mislead operators or compromise safety. Moreover, the open-source release of code, datasets, and verification tools signals a commitment to transparency and reproducibility—key pillars in legal acceptance of AI tools in safety-critical contexts. This work sets a benchmark for domain-specific AI generation, demonstrating that ethical, legal, and technical considerations can be harmonized without compromising innovation. Future research should extend this framework to cross-linguistic diversity in non-English maritime contexts and explore liability allocation between human operators and AI assistants.

Recommendations

  • Extend the verification pipeline to include cross-cultural linguistic validation for international maritime operations.
  • Develop a legal white paper or regulatory impact assessment to accompany the open-source release, facilitating smoother adoption by maritime authorities and legal stakeholders.

Sources