CARVIEW |
Select Language
HTTP/2 200
date: Wed, 08 Oct 2025 14:34:47 GMT
content-type: text/plain
content-length: 20815
cf-ray: 98b65c6e9f49c1ae-BLR
content-location: 23-mlw-lt-irc.txt
vary: negotiate,accept,accept-charset,Accept-Encoding,Origin
tcn: choice
last-modified: Wed, 23 Jan 2013 17:21:41 GMT
etag: "1093a-4d3f7ed739340;602037f93ee54
cache-control: max-age=21600
expires: Wed, 08 Oct 2025 20:34:47 GMT
content-encoding: gzip
access-control-allow-origin: *
x-backend: www-mirrors
x-request-id: 98b65c6e9f49c1ae
strict-transport-security: max-age=15552000; includeSubdomains; preload
content-security-policy: frame-ancestors 'self' https://cms.w3.org/ https://cms-dev.w3.org/; upgrade-insecure-requests
cf-cache-status: BYPASS
accept-ranges: bytes
set-cookie: __cf_bm=19KyMpHfogjvOE9myJDo6FbO1jy5kOav89ZfannAjyM-1759934087-1.0.1.1-jkUFQRvfJ6nS4HAyij_CvaOI3ie62ZBZY4d2f9EEn6qPRxqefEIxrhGIFlgoUARxf6k43nUQAfDiGTIiPuo3HLIqiRIkPlyGSqE0ZuqHWPU; path=/; expires=Wed, 08-Oct-25 15:04:47 GMT; domain=.w3.org; HttpOnly; Secure; SameSite=None
server: cloudflare
alt-svc: h3=":443"; ma=86400
07:34:02 RRSAgent has joined #mlw-lt
07:34:02 logging to https://www.w3.org/2013/01/23-mlw-lt-irc
07:34:09 meeting: MLW-LT f2f
07:34:11 chair: felix
07:34:20 agenda: https://www.w3.org/International/multilingualweb/lt/wiki/PragueJan2013f2f#Agenda
07:34:37 topic: role call
07:34:45 checking attendance ...
07:34:49 present+ fsasaki
07:34:59 Zakim has joined #mlw-lt
07:35:12 I have made the request to generate https://www.w3.org/2013/01/23-mlw-lt-minutes.html fsasaki
07:36:12 s/role/roll/
07:36:13 I have made the request to generate https://www.w3.org/2013/01/23-mlw-lt-minutes.html fsasaki
07:57:55 tadej has joined #mlw-lt
07:59:58 fsasaki has joined #mlw-lt
08:00:40 Yves_ has joined #mlw-lt
08:00:47 present+ Yves
08:00:57 daveL has joined #mlw-lt
08:01:07 Marcis has joined #mlw-lt
08:01:14 present+ Marcis
08:01:23 Ankit has joined #mlw-lt
08:01:23 leroy has joined #mlw-lt
08:01:29 Arle has joined #mlw-lt
08:01:31 present+ leroy
08:01:35 present+ Ankit
08:01:41 present+ Arle
08:01:43 I have made the request to generate https://www.w3.org/2013/01/23-mlw-lt-minutes.html fsasaki
08:06:37 topic: https://tinyurl.com/its20-comments-handling
08:06:41 truedesheim has joined #mlw-lt
08:06:48 present+ dave
08:07:17 Jirka has joined #mlw-lt
08:08:01 https://www.w3.org/International/multilingualweb/lt/wiki/PragueJan2013f2f#Agenda
08:09:21 pnietoca has joined #mlw-lt
08:09:21 mdelolmo has joined #mlw-lt
08:09:30 present+ pnietoca
08:09:31 scribe daveL
08:09:37 present+ mdelolmo
08:09:39 topic: issue-67
08:10:09 yves: had no feedback from shaun to date so we probabl can't advance here
08:10:37 related: https://www.w3.org/International/multilingualweb/lt/track/issues/105
08:10:48 kfritsche has joined #mlw-lt
08:11:03 present+ Karl
08:12:00 felix: comment could be addressed by dropping the ref to XML schema
08:13:05 yves: will respond on issue 105
08:13:14 topic: issue-69
08:13:23 related: https://www.w3.org/International/multilingualweb/lt/track/issues/69
08:14:20 External rules may also have links to other external rules (see example 20). The linking mechanism is recursive, and subsequently after the processing the rules MUST be read top-down (see example 21).
08:14:24 I have made the request to generate https://www.w3.org/2013/01/23-mlw-lt-minutes.html fsasaki
08:14:50 https://www.w3.org/TR/2012/WD-its20-20121206/#link-external-rules
08:14:58 pablo: had responded that this was clear in the specification, but suggest a clarification
08:15:19 the section is 5.4. (last paragraph)
08:15:23 felix: confirms this is just a clarification
08:15:26 change it
08:15:47 "The linking mechanism is recursive" > "The linking mechanism is recursive in a depth-first approach"
08:16:01 tadej: perhaps explain this recursion as being 'depth first' to be understandable more by computer scientists
08:17:04 topic: issue-70
08:17:20 related: https://www.w3.org/International/multilingualweb/lt/track/issues/70
08:18:09 felix: ref to section 5.5
08:18:14 https://www.w3.org/TR/2012/WD-its20-20121206/#selection-precedence
08:20:31 will add one entry between "global selections" and "data category defaults" for inherited information, but not specific to local markup
08:20:45 I have made the request to generate https://www.w3.org/2013/01/23-mlw-lt-minutes.html fsasaki
08:20:56 kfritsche has joined #mlw-lt
08:21:00 topic: issue-71
08:21:04 related: https://www.w3.org/International/multilingualweb/lt/track/issues/71
08:21:21 annotatorsRef
08:21:24 scribe: fsasaki
08:22:09 daveL: Yves said the problem is: you can have a lot of annotatorRefs
08:22:28 .. issue is: how to deal with annotatorRefs with two instances of local standoff markup
08:22:37 .. e.g. lq localization issues and provenacne records
08:22:54 .. so you can have multiple records of the same data category applying to the same selection
08:23:09 .. you don't get the information whether the information comes from different processes
08:23:36 .. Yves suggested whether we can put the information into the same ...
08:23:49 .. my view was: for provenacne annotator ref is not that important
08:24:19 .. so in the mail last night: could we exlude the lqi and provenance from annotatorsRef
08:24:34 .. annotatorRefs is telling you what provided the provenacne annotation
08:24:55 tadej: from provenance it is not needed, but for lqi?
08:28:08 dave: don't think so for lqissue.
08:28:35 yves: sounds weird: have annotatorsRef mandatory for some data cats, possible for others, forbidden for two ...
08:28:58 .. currently it is required for mt-confidence and disambiguation
08:29:21 ... and Terminology
08:29:26 yves: otehr solution: you could have it mandatory for these two data categories, and don't have it for others
08:29:36 .. that would make things a lot simpler
08:30:03 dave: agree - not having two features interacting (standoff and annotatorsRef) would be good
08:31:32 Milan has joined #mlw-lt
08:33:47 felix potential resolution - so keep it mandatory for mt-confidence, disambiguation and term, and edit the list of data category items in the spec
08:34:49 scribe: daveL
08:36:47 action: dLewis6 to come back to chase and kevin about discussion of issue-71 https://www.w3.org/2013/01/23-mlw-lt-irc#T08-34-49
08:36:48 Created ACTION-388 - Come back to chase and kevin about discussion of issue-71 https://www.w3.org/2013/01/23-mlw-lt-irc#T08-34-49 [on David Lewis - due 2013-01-30].
08:38:29 action: felix to change example https://www.w3.org/TR/2012/WD-its20-20121206/#EX-its-tool-annotation-1 if the agree on issue-71 , see discussion at https://www.w3.org/2013/01/23-mlw-lt-irc#T08-34-49
08:38:29 Created ACTION-389 - Change example https://www.w3.org/TR/2012/WD-its20-20121206/#EX-its-tool-annotation-1 if the agree on issue-71 , see discussion at https://www.w3.org/2013/01/23-mlw-lt-irc#T08-34-49 [on Felix Sasaki - due 2013-01-30].
08:39:32 swalter has joined #mlw-lt
08:40:00 present+ swalter
08:41:02 felix: example 28 needs to be revised also, will do this now
08:45:39 dF has joined #mlw-lt
08:46:33 present+ dF
08:49:37 scribe: fsasaki
08:49:56 daveL: using the example in the test file - should we have usage of the data categories in the elements?
08:49:58 yves: yes
08:51:35 dave: this example doesn't actually include the data category attributes to which the annotatorRef refers
08:52:00 felix: makes note that the test file and the example should be revised to include this
08:52:28 yves: we don't have annotatorsRef for all disambiguation examples
08:52:46 yves: we don't have annotatorRef in all examples of disambiguation
08:53:15 action: tadej to check disambiguation examples with regards to presence of annotatorsRef
08:53:15 Created ACTION-390 - Check disambiguation examples with regards to presence of annotatorsRef [on Tadej Štajner - due 2013-01-30].
08:55:13 topic: ISSUE-72 NIF comment
08:55:32 felix: comment was which version of NIF do we refer to
08:55:40 https://lists.w3.org/Archives/Public/public-multilingualweb-lt-comments/2013Jan/0015.html
08:55:55 .. there are 1.0 and 2.0
08:56:08 .. also there stabilit was raises
08:57:21 ... and Christian also raised whether the mapping was canonical
08:58:19 dF: it may be a useful clarification for implementators
08:58:46 felix: but its not clear what is meant by 'canonical XML' in this case
08:59:34 tadej: it implied there should be a canonical XML serialisation
09:00:56 felix: would such a requirement raise a bar for implementors, this need to be dicussed further on the lists
09:01:48 felix: now will attempt to dial in Christian
09:04:36 rrsagent, draft minutes
09:04:36 I have made the request to generate https://www.w3.org/2013/01/23-mlw-lt-minutes.html Yves_
09:08:11 topic: issue-68
09:08:17 scribe: fsasaki
09:08:54 marcis: there was a discussion on ITS term and disambiguation
09:09:05 chriLi has joined #mlw-lt
09:09:08 .. christian brought it up, various comments from the WG
09:09:34 .. david suggested that we should not break ITS1.0, but felix said it is not necessary to have it
09:09:46 marcis: summarises discussion
09:10:06 https://lists.w3.org/Archives/Public/public-multilingualweb-lt-comments/2013Jan/0152.html
09:10:15 daveF: don't break it if it works
09:10:22 .. that's the bottom line
09:10:41 .. we want to keep also independence of features
09:10:58 marcis: I could implement terminology independent of the rest of disambiugation
09:11:13 .. the question is: if we agree to change something, it is independent, so different question
09:11:30 .. david suggested to have a bp document that specifies how things relate
09:11:44 daveF: there are seperate use cases for disambiguation and terminology
09:12:02 .. things are backed by different use cases, also from the implementers point of view
09:12:56 felix: we can also depcreate one of these
09:13:09 tadej: if we want to annotate the same fragment - which one to choose?
09:13:14 marcis: that is the biggest problem
09:13:21 .. we cannot do both
09:13:54 marcis: there was a comment from yves, we should break larger problems into smaller ones
09:14:20 .. so even if we have an "upper level" data category which we could then use for both scenarios
09:14:51 tadej: we could use the same trip we did with annotators ref, e.g. using multiple values in the same attribute
09:15:03 .. not sure if we would encourage people to do this
09:15:16 .. complex, but same level of complexity as ...
09:15:57 tadej: another solution tadej suggested was to have many attributes , but that's the same as having everything in one attribute
09:16:10 .. if we can come up with a closed set of types of annotation, that's a solution
09:16:23 .. but that needs to be a closed set, since we are specifying attributes
09:16:48 .. right now for disambiguation we agreed for three levels: concept, entity, lexicon
09:17:10 marcis: there is no definition for each of these levels, e.g. what is a lexical concept?
09:17:20 .. I saw that there is a terminology inconcistency
09:17:34 .. terminology is not used always in the same way in the disambiguation description
09:18:07 daveL: the issue in using both of them for the same term - we are not clear how to combine them?
09:18:16 tadej: it is not an issue at the moment
09:18:34 .. if you fold it in one data category, it becomes a problem
09:18:51 queue
09:19:13 q+
09:19:45 daveF: a big system will have a terminology life cycle with many manual people, but it is an automatic workflow
09:20:09 daveL: aim of disambiguation is that it would make the output of automatic annotation available
09:20:21 ack c
09:20:51 christian: thanks to marcis for putting everything into a condensed form
09:21:08 .. there are we with the discussion today: my understanding is the following:
09:21:45 .. people think it is not a bad idea to try to come up with a data category that can subsume what ITS2 terminology and ITS2 disambiguation try to cover
09:22:05 .. with respect to paying attention to ITS1: situation is that there is no need to go for backwards compatibility
09:22:17 .. one way to achive soft transition would be to deprecate existing ITS term
09:22:32 s/achive/achieve/
09:23:00 .. one way to come up with the upper level data category: two implementation suggestions were made: based on attrbiute values and distinct values for annotation types
09:23:01 q+
09:23:18 .. this is how I understand the current state of the discussion
09:23:31 .. I'm wondering what the next step would be
09:23:46 .. to say: we realize that we want to really look into this change
09:23:55 .. and want to do something to the current draft
09:24:15 .. if this wants to be driven it could be done via mail or a seperate call
09:24:27 .. need to agree on the approach
09:24:35 ack fsa
09:24:56 scrie: daveL
09:25:12 scribe: daveL
09:25:40 felix: we have agreement that backward compatability isn't an absolute barrier
09:25:54 ... but it is in my view desirable
09:26:01 Christian: fully agree
09:26:37 felix: another point is trying in general to reduce level of substantive change
09:26:57 felix: another point is experience of people who implement and knwo users of its1.0 terminology
09:27:11 ... such as yves and OKAPI community
09:27:49 yves: not necessarily a big problem to change but would like to keep backward compatibility in general
09:28:08 rrsagent, draft minutes
09:28:08 I have made the request to generate https://www.w3.org/2013/01/23-mlw-lt-minutes.html Yves_
09:28:12 tadej: suggested changes would break backward compatibility
09:29:14 macis: potetnially we add complexity to terminology by including link to external ontology or other lexical resource
09:29:23 df: agrees
09:30:46 felix: compromise is having an umbrella data category, and allow term to stay the same
09:31:02 q+
09:31:18 ack ch
09:31:23 arle: agre with marcis
09:31:28 s/agre/agree/
09:31:37 marcis: have some questionns about the definition of disambiguation, e.g. the meaning of what is a lexical concept
09:32:23 christian: support having an umbrella data category that would not increase complexity of seaprate term and disambiguation use case
09:33:24 I have made the request to generate https://www.w3.org/2013/01/23-mlw-lt-minutes.html fsasaki
09:33:25 ... also we will get better uptake if we can offer an easier route to marking up the output of text analysis
09:33:51 ... rather than having to support the more complex issues in disambiguation
09:34:38 present+ christan(for10-11-call)
09:34:40 I have made the request to generate https://www.w3.org/2013/01/23-mlw-lt-minutes.html fsasaki
09:35:23 tadej: the reason for defining granularities was the major requirements of linguists, it was not sufficient to have this all in the target external data structure
09:35:47 ... so even granularity definition was a compromise
09:36:03 arle: the term 'granularity' may also be an issue
09:36:43 tadej: was previously 'disambiguation type', but it was difficult to find the right term
09:37:39 felxi: asks tadej, marcis, christan to come up with a proposal that allows for both use cases and consider backward comatibility for term?
09:37:57 ... but this would need to be done by the end of next week?
09:39:07 Without putting too much thought into it, would disambiguationClassType work? Would this always correspond to a description of the kind of disambiguationClass intended?
09:39:49 christian: happy to let marcis and tadej to try and draft something over these two days and then I can dial in again to discuss it further
09:40:04 marcis: asks who was originator of disambig
09:41:38 tadej: originally it was a named entity recoginiser category, but after discussion also became merged with diasambiguation afteter discussion with linguasev and others
09:42:29 marcis: could we have a cascading model, since named entity can be composite
09:43:56 Don't forget to bring the beer bottles to the room as well :-)
09:44:02 daveL: note this overlaps with issue-109 on disambiguation in indic languages
09:44:22 topic: issue-75
09:44:34 https://lists.w3.org/Archives/Public/public-multilingualweb-lt-comments/2013Jan/0143.html
09:45:02 felix: jorge as shepard has produced a summary of this topic
09:45:25 christian: my domain comment had three parts
09:46:20 .. one main point - was looking for a way for providing to meta-data on a domain without pointing to resource, this has no eyyt been resolved
09:46:41 ... another point was that domain meta-data is processor specific
09:47:24 ... so in one world it is called x then the context in which x is meaningful needs to be provided
09:47:57 https://lists.w3.org/Archives/Public/public-multilingualweb-lt-comments/2013Jan/0136.html
09:48:08 ... now jorge has resolved point 2b, but the baove has still also to be resolved
09:49:12 felix: felt adding this context meta was a new feature but could be reolved with a note that this relates to a single engine use case
09:50:09 https://www.w3.org/International/multilingualweb/lt/track/products/9
09:50:13 christian; broadly agrees such a note would satisfy him, since in ITS the focus was on scenarios with a single engine scenario. But this need to be made clear as an assumption in ITS2.0
09:50:44 felix: have now started collacting items on tracker categories as 'not addressed in ITS2.0'
09:51:54 topic: issue-73
09:51:55 ... so if larger implementors, e.g. sap, adobe, ms, will but resoruces into the multiengine scenario we could consider it, other we should stick with making explicit the single engine context
09:52:23 https://lists.w3.org/Archives/Public/public-multilingualweb-lt-comments/2013Jan/0015.html
09:53:03 felix: with NIF the stability is an issue and will refer back to sebastian Helleman about the plan for this
09:53:17 ... need this information to react fully to this comment
09:54:00 .. other comment was how the mapping could benefit from canonical definiition of mapping
09:54:19 q+
09:54:35 ack chr
09:54:53 Felix: so my comment is whether this would be of use to implementors, since in the room there was a lot of familiarisation with the use and benefits of canonicalisation
09:55:19 christian: asks do we have more than one implementation
09:55:32 felix: confirms we have one from sebastian and one from felix
09:56:41 christian: I brought this up to ensure that whenever NIF processing is ensured, we end up with the same representation, and this needs normalisation and canonicalisation
09:57:21 ... if not, then we may end up with versions that are incompatible
09:58:29 felix: asks whether some comparison between document in NIF is an likely use case. would the comparison not takeplace back in the document itself
09:59:25 chrsitian: I think you would need a unicode normalisation
09:59:41 felix: but this was related to regex in another data category
09:59:58 s/chrsitian/christian/
10:00:16 christian: if we are reocmmending normalisation anyway in this other data category, could we not use this to solve the problem here
10:00:34 topic: issue-74
10:01:05 q+
10:01:39 scribe: fsasaki
10:01:48 daveL: christian provided some bullet point comments
10:01:55 .. are you planning more re-writing
10:02:04 .. or should david and I take your comments in?
10:02:06 ack cl
10:02:10