[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: RFC: ABI support for special memory area

To: "H.J. Lu" <hjl.tools@gmail.com>
Subject: Re: RFC: ABI support for special memory area
From: Suprateeka R Hegde <hegdesmailbox@gmail.com>
Date: Thu, 9 Mar 2017 20:53:54 +0530
Authentication-results: sourceware.org; auth=none
Cc: Carlos O'Donell <carlos@redhat.com>, gnu-gabi@sourceware.org
Delivered-to: listarch-gnu-gabi@sourceware.org
Delivered-to: mailing list gnu-gabi@sourceware.org
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=reply-to:subject:references:to:cc:from:organization:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding; bh=H2jikEerJ0cQ9Tjl1hszbgGjbg8IxNimdAPL65/FKMM=; b=p0aECueiBnXGCNo6Ynjx6tT0txKChUzPLhIf9BPC39ULC/PjB/gNXutW+nZ9rsX8Ah /DYGccChowArHhp5znpnoH6BklamQZCTs2CaHI2UfeD3s9LmbkbvVfR57jwIemg/A+Xz 1rA/99wz+EGzPsWJhB1Qx5MdYKC+oz4hakNrA/rOSu64lhfFV3CY231VbYscZAI2tQbU Fv+jQJu2k29gNj5+nXw/FPlRs9/VqXjhXZkXO0dfegcTG2ki2/W9lityOG8tFuqgzgpJ ib9N5m3ZRwTGOwaIBkDvKdHuliSMsRh5D/QUXh75INPRVfiDXP69Rhlq0iYMUF7nHd6U iSCQ==
In-reply-to: <CAMe9rOprH3RXhvoW4Tnie7V4WXb8BTyOCSu-LTutvZdHhnKy7w@mail.gmail.com>
List-help: <mailto:gnu-gabi-help@sourceware.org>
List-id: <gnu-gabi.sourceware.org>
List-post: <mailto:gnu-gabi@sourceware.org>
List-subscribe: <mailto:gnu-gabi-subscribe@sourceware.org>
Mailing-list: contact gnu-gabi-help@sourceware.org; run by ezmlm
Organization: HEGDESASPECT
References: <CAMe9rOrwm=U9891moSKLQEtbtLMTibFfh5Ok05YjMG_bJj7rJw@mail.gmail.com> <ee7a8072-08a5-3efc-dffb-fd0ab504e1d6@gmail.com> <CAMe9rOpp33=Nc_jJTYas8=L+XXN=H43_t_jmQCqp63s+fi3o1w@mail.gmail.com> <88608944-14c9-9d28-80d1-32283521683b@gmail.com> <CAMe9rOq9YmQAw_9XyJ10yn0OPNv4s4SMEUp7sae8nMspAnx5cA@mail.gmail.com> <ef10d0ae-9c5e-1a20-eef4-bea29005b1ed@redhat.com> <ea55babe-9848-c5f0-5b79-2c0a835bc401@gmail.com> <CAMe9rOprH3RXhvoW4Tnie7V4WXb8BTyOCSu-LTutvZdHhnKy7w@mail.gmail.com>
Reply-to: hegdesmailbox@gmail.com
Sender: gnu-gabi-owner@sourceware.org
User-agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:45.0) Gecko/20100101 Thunderbird/45.7.1

H.J,

I think we are full 180 degrees out-of-phase in our discussion this timesomehow :-)

As I have already asked, I want to know what is that ONE-FIXED-FORM of__gnu_mbind_setup being called by ld.so.

The code you provided seems to be of Intel's implementation of libmbind.I am interested in how it looks like in ld.so. Because that is what wewant to document in the ABI support. We do not want implementationspecific details in GNU-gABI.

So inside ld.so, would it be what I showed in my earlier mail or wouldit be something else?

In my opinion, we have to bring that out in the ABI support proposal.Without the actual signature/prototype, __gnu_mbind_setup sounds morelike a guideline and less like a ABI spec/standard. And in actual code(in ld.so), it may eventually appear really different for eachvendor/implementation.

So, either keep it as a guideline or make it generic. IMHO, we can notkeep the following (original text) as generic:

---

Run-time support

int __gnu_mbind_setup (unsigned int type, void *addr, size_t length);

---

--
Supra


On 07-Mar-2017 04:05 AM, H.J. Lu wrote:

On Mon, Mar 6, 2017 at 5:25 AM, Suprateeka R Hegde
<hegdesmailbox@gmail.com> wrote:

On 04-Mar-2017 07:37 AM, Carlos O'Donell wrote:


On 03/03/2017 11:00 AM, H.J. Lu wrote:


__gnu_mbind_setup is called from ld.so.  Since there is only one ld.so,
it needs to know what to pass to __gnu_mbind_setup.  Not all arguments
have to be used by all implementations nor all memory types.



I think what Supra is suggesting is a pointer-to-implementation interface
which would allow ld.so to pass completely different arguments to the
library depending on what kind of memory is being defined by the sh_info
value. It avoids needing to encode all the types in the API, and just
uses an incomplete pointer to the type.



Thats absolutely right.

However, I am not suggesting one is better over the other. I just want to
get clarity on how the code looks like for different implementations.

On 03-Mar-2017 09:30 PM, H.J. Lu wrote:


__gnu_mbind_setup is called from ld.so.  Since there is only one ld.so,
it needs to know what to pass to __gnu_mbind_setup.



So I want to know what is that ONE-FIXED-FORM of __gnu_mbind_setup being
called by ld.so.

 Not all arguments
have to be used by all implementations nor all memory types.



I think I am still not getting this. Really sorry for that. Would it be
possible for you to write a small pseudo code that depicts how this design
looks like for different implementations?


For my usage, I only want to know memory type, address and its size:

#define _GNU_SOURCE
#include <unistd.h>
#include <errno.h>
#include <stdint.h>
#include <cpuid.h>
#include <numa.h>
#include <numaif.h>
#include <mbind.h>

#ifdef LIBMBIND_DEBUG
#include <stdio.h>
#endif

/* High-Bandwidth Memory node mask.  */
static struct bitmask *hbw_node_mask;

/* Initialize High-Bandwidth Memory node mask.  This must be called before
   __gnu_mbind_setup.  */
static void
__attribute__ ((used, constructor))
init_node_mask (void)
{
  if (__get_cpuid_max (0, 0) == 0)
    return;

  /* Check if vendor is Intel.  */
  uint32_t eax, ebx, ecx, edx;
  __cpuid (0, eax, ebx, ecx, edx);
  if (!(ebx == 0x756e6547 && ecx == 0x6c65746e && edx == 0x49656e69))
    return;

  /* Get family and model.  */
  uint32_t model;
  uint32_t family;
  __cpuid (1, eax, ebx, ecx, edx);
  family = (eax >> 8) & 0x0f;
  if (family != 0x6)
    return;
  model = (eax >> 4) & 0x0f;
  model += (eax >> 12) & 0xf0;

  /* Check for KNL and KNM.  */
  switch (model)
    {
    default:
      return;

    case 0x57: /* Knights Landing.  */
    case 0x85: /* Knights Mill.  */
      break;
    }

  /* Check if NUMA configuration is supported.  */
  int nodes_num = numa_num_configured_nodes ();
  if (nodes_num < 2)
    return;

  /* Get MCDRAM NUMA nodes.  */
  struct bitmask *node_mask = numa_allocate_nodemask ();
  struct bitmask *node_cpu = numa_allocate_cpumask ();

  int i;
  for (i = 0; i < nodes_num; i++)
    {
      numa_node_to_cpus (i, node_cpu);
      /* NUMA node without CPU is MCDRAM node.  */
      if (numa_bitmask_weight (node_cpu) == 0)
numa_bitmask_setbit (node_mask, i);
    }

  if (numa_bitmask_weight (node_mask) != 0)
    {
      /* On Knights Landing and Knights Mill, MCDRAM is High-Bandwidth
Memory.  */
      hbw_node_mask = node_mask;
    }
  else
    numa_bitmask_free (node_mask);
  numa_bitmask_free (node_cpu);
}

/* Support all different memory types.  */

static int
mbind_setup (unsigned int type, void *addr, size_t length,
    unsigned int mode, unsigned int flags)
{
  int err = ENXIO;

  switch (type)
    {
    default:
#ifdef LIBMBIND_DEBUG
      printf ("Unsupported mbind type %d: from %p of size %p\n",
     type, addr, length);
#endif
      return EINVAL;

    case GNU_MBIND_HBW:
      if (hbw_node_mask)
err = mbind (addr, length, mode, hbw_node_mask->maskp,
    hbw_node_mask->size, flags);
      break;
    }

  if (err < 0)
    err = errno;

#ifdef LIBMBIND_DEBUG
  printf ("Mbind type %d: from %p of size %p\n", type, addr, length);
#endif

  return err;
}

int
__gnu_mbind_setup (unsigned int type, void *addr, size_t length)
{
  return mbind_setup (type, addr, length, MPOL_BIND, MPOL_MF_MOVE);
}

If other memory types need additional information, they can be
passed to __gnu_mbind_setup.  We just need to know what
information is needed.

Follow-Ups:
- Re: RFC: ABI support for special memory area
  - From: "H.J. Lu" <hjl.tools@gmail.com>

References:
- RFC: ABI support for special memory area
  - From: "H.J. Lu" <hjl.tools@gmail.com>
- Re: RFC: ABI support for special memory area
  - From: Suprateeka R Hegde <hegdesmailbox@gmail.com>
- Re: RFC: ABI support for special memory area
  - From: "H.J. Lu" <hjl.tools@gmail.com>
- Re: RFC: ABI support for special memory area
  - From: Suprateeka R Hegde <hegdesmailbox@gmail.com>
- Re: RFC: ABI support for special memory area
  - From: "H.J. Lu" <hjl.tools@gmail.com>
- Re: RFC: ABI support for special memory area
  - From: Carlos O'Donell <carlos@redhat.com>
- Re: RFC: ABI support for special memory area
  - From: Suprateeka R Hegde <hegdesmailbox@gmail.com>
- Re: RFC: ABI support for special memory area
  - From: "H.J. Lu" <hjl.tools@gmail.com>

Prev by Date: Re: RFC: ABI support for special memory area
Next by Date: Re: RFC: ABI support for special memory area
Previous by thread: Re: RFC: ABI support for special memory area
Next by thread: Re: RFC: ABI support for special memory area
Index(es):
- Date
- Thread