-
Notifications
You must be signed in to change notification settings - Fork 189
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Should db.system be db.system.name? #1581
Comments
I have an arbitrary (and optional) rule of thumb for attribute names to follow The |
Agree that adding |
I can open a PR. Can a maintainer assign it to me? |
@lmolkova and I added this to Monday's semconv meeting to get more feedback since it would affect several semantic conventions
|
Discussed (very) briefly with feature flag sig and nobody had an objection |
I think this change might be a good way to future proof against entities in the future. Assuming we will want to use the same dictionary of attributes for entities, we might have things like system version, region, etc for entities in the future. |
an interesting exception to this rule appears to be |
@jack-berg this db change is being conflated with genai in this PR, was that your intent? #1613 |
the main problem conflating as I tried to elude to in comments is that genai is more like rest api in practice. So, provider like cloud provider, and usually the cloud provider isn't versioned. It is one or more api groups within it that are versioned, if that happens. I feel a need to be on the defense that some decision made about databases will be projected out to genai, even if it wouldn't be projected out to normal cloud APIs which usually genai systems are implemented as. I wish semantically we could decouple these things, so that we can focus more on progress than defense or accident of coupling. an LLM is not a database |
We're trying to stay consistent across domains. One way to achieve it is to follow the same naming patterns. The naming pattern We should fix it across the repo and it has nothing to do with how similar or different the technologies are. We can pick different names (e.g. |
When I opened this issue I was making an observation about a single domain, but I think extracting out common modeling rules adds rails which improve our productivity and consistency. Of course, the trick is deciding where it makes sense to have common modeling rules and when a rule is / isn't applicable to a particular situation. |
unsolicited 2p again: concrete examples especially in each domain are useful not just in defining, but also when breaking attributes. Otherwise, the cost to break isn't understood and we contribute to a general sense of perhaps arbitrary instability. If we can't prove 3 things that require breaking out a name into a struct, probably best to think twice. This applies to all domains, so each one who uses provider has a different context for that. So, I mean real values not invented ones in one domain to support of a proposal of a change in a different domain. End users would be best to suggest these problems. In summary, it is easy in abstract to justify a model change when not put to use, but it is better to use real problems reported by users to motivate cross-cutting changes. End of unsolicited advice, hope some of it is considered while rolling out this change to several domains. |
I think there are competing concerns. Change for change's sake has bad optics and is hard on users. But given that semantic-conventions is fundamentally a taxonomy project, there is some real value in having conceptual purity. Things like symmetry across domains and following rules of thumb to ensure that we don't get boxed-in in the future. I suspect that in the long term too much slop would compound into a taxonomy that people don't like and want to replace. To me, its classic short term vs. long term trade off. Tough to balance. Ideally, as we get more experience, we extract out the common rules / guidelines (i.e. extending what we already have in places like naming considerations) so that we have a higher likelihood of getting it right the first time (good for users) while also having consistency and maintaining ability to evolve. |
conceptual purity != naming convention purity across different concepts. unrelated buckets to satisfy naming convention purity reduces conceptual purity. It doesn't serve one domain to force to be in unrelated and conflated buckets due to another domain. It is a non-goal imho to trade concepts for naming conventions of unrelated tech. Let's focus on this when trying to optimize, as really what's good for users is things being coherent and also not drifty. |
The original We should expect more of these (e.g. Having patterns and common attributes/guidance across conventions is important and we're going to push for consistency in experimental conventions. |
Yes, we can break db etc because .name or other things we feel are important, but some person in TC should ask themselves "should we" or "when should we" especially as the blast radius of work if far outside this org. If the TC decides to not consider the blast of work as a part of what's important in conventions, that's a hazard for anyone participating. I'm speaking into the void as I think most people impacted by this personally are not subscribed to this issue. However, a simple string search on github should show that renaming attributes widely used has impact on people not on this issue to see it. |
you should ask @open-telemetry/specs-semconv-maintainers who are responsible for this repo. |
To both @jack-berg and @lmolkova's points here - We are making a long term vs. short term trade-off. Semantic Conventions is a long-term focused projects. The value of conventions is that they are stable + consistent. Specifically they need to be stable for a long period of time, and consistent across a large body of domains and vendors. For this specific issue, @lmolkova and the HTTP/DB semconv folks have been carving that path for us recently, including the "meta" conventions we use. Additional good reading comes from the Systems semconv group, that finally outlined some principles around our ideas of "t-shaped" conventions: #1618
That is entirely the point of this issue - answering that question. This is something we ask ourselves and debate a LOT within semconv. It's ultimately some of the hardest decision making within OpenTelemetry (that I've been involved with) and one we don't take lightly. But please, don't mistake disagreement for lack of care or thought. We understand, and have been reinforced in that understanding, the rippling implications of change for OpenTelemetry on changing semantic conventions. However, you should envision the final state of Semconv more as a 2.0 of instrumentation for OpenTelemetry, as evidenced in the work the Java Agent took for HTTP semconv. We do not want to leave folks relying on de-facto stable semantic conventions in the dust. However, we do need to change and evolve what we have. It's been where a majority of the 'core' effort in semconv has been placed. Now directly to this issue. DB and LLM semantic conventions are not stable and in fact, are rapidly changing. I think there are two needs here:
Semantic conventions focuses on the second. The first SHOULD be provided before coming to semantic conventions. Semantic Conventions doesn't aim to immediately replace these, but slowly provide a stable alternative to where folks are today.
I'm not sure I appreciate the combative nature about "defense against decisions" here. This is an open community building consensus towards conventions across that community. Of course decisions around those general conventions will impact everyone, this is why we have a "general" semconv meeting and discuss/raise issues there. Even if unable to attend these meetings, the key PRs are called out in notes so folks can follow along and comment. All of these general principles we define, we're working towards PRs to encode them, e.g. #1707. You are more than welcome to participate in these discussions, PRs and offer thoughts. |
Area(s)
area:db
Is your change request related to a problem? Please describe.
As we approach stability, I wanted to throw out a question of whether we should rename
db.system
todb.system.name
in order to preserve the ability to usedb.system
as a namespace.I don't see any need to include other
db.system.*
attributes on metrics / spans. But supposing semantic conventions expand to database servers (e.g. resource conventions to represent a database server), its possible we might want to include other attributes along the lines ofdb.system.version
.Describe the solution you'd like
Consider renaming
db.system
todb.system.name
.Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: