source: git/lib/fixmsgs.pl @ ebf46a3

RELEASE/1.0RELEASE/1.1RELEASE/1.2debug-cidebug-ci-sanitisersstereowalls-data
Last change on this file since ebf46a3 was a39f74f, checked in by Olly Betts <olly@…>, 23 years ago

Script to tidy up accented characters, etc in Survex message file.

git-svn-id: file:///home/survex-svn/survex/trunk@1038 4b37db11-9a0c-4f06-9ece-9ab7cdaee568

  • Property mode set to 100755
File size: 499 bytes
Line 
1#!/usr/bin/perl -w
2require 5.004;
3use strict;
4
5use integer;
6
7my @ent = ();
8for (128..255) {
9   $ent[$_] = "#$_";
10}
11
12open ENT, "named-entities.txt" or die $!;
13while (<ENT>) {
14   my ($e, $v) = /^(\w+),(\d+)/;
15   $ent[$v] = $e;
16}
17close ENT;
18
19while (<>) {
20   if (/^\s*#/) {
21      print;
22      next;
23   }
24   
25   my ($pre, $msg) = /^([-\w,]+:\s*\d+\s+)(.*)/;
26
27   $msg =~ s/([\x80-\xff])/"&".$ent[ord($1)].";"/ge;
28
29   $msg =~ s/'\%s'/`%s'/g;
30   $msg =~ s/^([^`']* )\%s'/$1`%s'/g;
31
32   print "$pre$msg\n";
33}
Note: See TracBrowser for help on using the repository browser.