Kotlin Native编译原理03 - 简单「深入」理解Objective-C运行时（二）

其实上一篇文章《Kotlin Native编译原理02 - 简单「深入」理解Objective-C运行时（一）》写完后，这篇文章就立马开始写了。但是在写文章的那段时间，有很多活动，所以写文章的事情也渐渐耽搁了下来，直到最近。

上一篇文章写完后，我就在思考，写的文章是不是跑题了？明明我要讲的是“运行时”，为什么会牵涉到很多内核甚至指令集的知识？但是事实就是这么有意思，“运行时”确实是底层原理息息相关。

OOP与消息#

世间本无OOP，OOP的概念是如何发明的呢？

1950-1960年代，大家还在用LISP语言开发时，开始对某块连续的内存（结构体）描述成对象。1960年代，Simula语言诞生，首次引入了对象、类、继承、虚过程（早期的vtable）的概念，是世界上第一门面向对象的语言。Simula首次将代码跟子过程绑定在一起，后期这一概念称为“方法”，即某个类对一个函数的实现。类里的每个方法都会被编译为单独的函数，每个函数的第一个参数固定为对象地址。这样调用某个对象的方法，其实是调用方法对应的函数，第一个参数传入该对象的地址。

Alan Kay受Simula的启发，结合自己对OOP的理解，于1970年发布第一门纯面向对象语言Smalltalk。Alan Kay对OOP的理解和Simula是不完全相同的，他认为对象间只能通过消息通讯，而不是方法：

I thought of objects being like biological cells and/or individual computers on a network, only able to communicate with messages (so messaging came at the very beginning – it took a while to see how to do messaging in a programming language efficiently enough to be useful). Alan Kay

消息跟方法有什么不同？方法本质上还是函数，只是第一个参数写死为对象地址，所以不能调用动态地往对象里加方法。而消息是让对象执行某个逻辑的请求，对象收到消息后内部决定是否处理，以及如何处理消息。不管是编译时还是运行时，给对象发任何消息都是可行的。在实现上，对象内部会有一个类似消息派发中心的逻辑，专门负责处理消息。如果消息能被处理，再派发到对应的处理子过程/函数里。

1981年Brad Cox在ITT上班，开始接触Smalltalk。而他也意识到C语言面向过程的局限性，决定给C语言加上Smalltalk的特性。并于1983年，发布了支持Smalltalk对象特性的C语言预编译器：OOPC

Just a moment...

dl.acm.org

后面他们发现，用预编译器来实现面向对象的特性，局限性太大了。于是他转向支持C语言面向对象拓展的开发，并于1986年，通过Stepstone发布了支持面向对象特性的C语言——Objective-C。

1988年，乔布斯离开Apple，创办NeXT公司，开发NeXTSTEP操作系统，正苦于为NeXTSTEP寻找一门面向对象且效率高并且支持C语言的语言。乔布斯先前访问过开发Smalltalk语言的公司，Smalltalk对他产生了非常大的影响。后面他发现了Objective-C，所以一拍即合，选择Objective-C作为NeXTSTEP系统的开发语言。

后面的事情大家也知道了。20世纪末，乔布斯重返Apple，Apple收购了NeXT公司，NeXTSTEP里优秀的Cocoa库收归Apple。Brad Cox创办的Stepstone公司也于20世纪末期被Apple收购，至此Objective-C成为了Apple开发首选语言，直到2014年Swift的发布。

为什么Objective-C调用对象子过程被称为消息？因为这本来就是Objective-C诞生的原因。

初探objc_msgSend#

在上一章我们就知道，对于[objc sayHello]：

实际调用objc_msgSend(objc, SEL("sayHello")) 。
SEL("sayHello") 是伪代码，实际是一个指向sayHello 的可读区域字符串指针。
objc_msgSend的原型是

1
OBJC_EXPORT id _Nullable
2
objc_msgSend(id _Nullable self, SEL _Nonnull op, ...)
3
    OBJC_AVAILABLE(10.0, 2.0, 9.0, 1.0, 2.0);

其中：

id 传入接收消息的对象
SEL selector
op 可变参数，是消息参数，并受 method type encoding 与 ABI 约束

SEL#

我们来看一个demo：

1
#include <Foundation/Foundation.h>
2
#include <objc/message.h>
3

4
@interface MyClass: NSObject
5
@end
6

7
@implementation MyClass
8
- (void)sayHi {
9
  printf("Hi\n");
10
}
11
@end
12

13
int main() {
14
  MyClass *obj = [[MyClass alloc] init];
15
  // 1
16
  ((void (*)(id, SEL))objc_msgSend)(obj, @selector(sayHi));
17
  // 2
18
  ((void (*)(id, SEL))objc_msgSend)(obj, NSSelectorFromString(@"sayHi"));
19
  // 3 下面的代码会导致异常
20
  ((void (*)(id, SEL))objc_msgSend)(obj, "sayHi");
21
}

3出现异常，说明在Objective-C里，"sayHi" 与 @selector(sayHi) / NSSelectorFromString(@"sayHi") 并不属于同一条消息。

@selector 是什么？

从上一章我们可以知道，@selector(msg_name)实际上是从__objc_selrefs段取指向__objc_methname段里字符串为msg_name的指针。

NSSelectorFromString是什么？

我们先看结果，看下 NSSelectorFromString(@"sayHi") 返回什么：

1
#include <Foundation/Foundation.h>
2
#include <objc/message.h>
3

4
int main() {
5
  printf("%p\n", @selector(sayHi));
6
  printf("%p\n", NSSelectorFromString(@"sayHi"));
7
}
8

9
0x10244ca22
10
0x10244ca22

@selector(sayHi) == NSSelectorFromString(@"sayHi") ，这也就能说明为什么demo中1和2的执行结果一致。

并且，"sayHi" 存在字符串常量区，地址与 @selector(sayHi) / NSSelectorFromString(@"sayHi") 不同。这也能够说明对于两个selector，Objective-C是通过比对selector的地址来判断是否属于同一条消息，并不是通过简单的字符串比对判断。换句话说，SEL 的唯一性来自 sel_registerName 的**字符串驻留（intern）**逻辑。

而这种「将字符串当作SEL传入objc_msgSend」的行为，属于UB。实际编码过程万万不可这么写。😩

NSSelectorFromString 为什么会返回一个指向__objc_methname 段里的selector呢？NSSelectorFromString 并没有开源，我们写个程序打断点看看。

写一个非常简单的demo程序#

1
#include <Foundation/Foundation.h>
2
#include <objc/message.h>
3

4
int main() {
5
  void *sel = NSSelectorFromString(@"sayHi");
6
}

编译+打断点#

1
(lldb) br set -r NSSelectorFromString
2
Breakpoint 1: where = Foundation`NSSelectorFromString, address = 0x000000018ee87f20
3
(lldb) r
4
Process 6955 launched: '/Users/orangeboy/Downloads/untitled folder/test' (arm64)
5
Process 6955 stopped
6
* thread #1, queue = 'com.apple.main-thread', stop reason = breakpoint 1.1
7
    frame #0: 0x000000018ee87f20 Foundation`NSSelectorFromString
8
Foundation`NSSelectorFromString:
9
->  0x18ee87f20 <+0>:  pacibsp
10
    0x18ee87f24 <+4>:  stp    x22, x21, [sp, #-0x30]!
11
    0x18ee87f28 <+8>:  stp    x20, x19, [sp, #0x10]
12
    0x18ee87f2c <+12>: stp    x29, x30, [sp, #0x20]
13
Target 0: (test) stopped.
14
(lldb) disa
15
Foundation`NSSelectorFromString:
16
->  0x18ee87f20 <+0>:   pacibsp
17
    0x18ee87f24 <+4>:   stp    x22, x21, [sp, #-0x30]!
18
    0x18ee87f28 <+8>:   stp    x20, x19, [sp, #0x10]
19
    0x18ee87f2c <+12>:  stp    x29, x30, [sp, #0x20]
20
    0x18ee87f30 <+16>:  add    x29, sp, #0x20
21
    0x18ee87f34 <+20>:  sub    sp, sp, #0x3f0
22
    0x18ee87f38 <+24>:  adrp   x8, 402965
23
    0x18ee87f3c <+28>:  ldr    x8, [x8, #0x388]
24
    0x18ee87f40 <+32>:  ldr    x8, [x8]
25
    0x18ee87f44 <+36>:  stur   x8, [x29, #-0x28]
26
    0x18ee87f48 <+40>:  cbz    x0, 0x18ee87fc0 ; <+160>
27
    0x18ee87f4c <+44>:  mov    x19, x0
28
    0x18ee87f50 <+48>:  bl     0x18fc95e80    ; objc_msgSend$length
29
    0x18ee87f54 <+52>:  mov    x20, x0
30
    0x18ee87f58 <+56>:  mov    x2, sp
31
    0x18ee87f5c <+60>:  mov    x0, x19
32
    0x18ee87f60 <+64>:  mov    w3, #0x3e8 ; =1000
33
    0x18ee87f64 <+68>:  mov    w4, #0x4 ; =4
34
    0x18ee87f68 <+72>:  bl     0x18fc8f1c0    ; objc_msgSend$getCString:maxLength:encoding:
35
    0x18ee87f6c <+76>:  cbz    w0, 0x18ee87f88 ; <+104>
36
    0x18ee87f70 <+80>:  mov    x0, sp
37
    0x18ee87f74 <+84>:  bl     0x18f88336c    ; symbol stub for: strlen
38
    0x18ee87f78 <+88>:  cmp    x0, x20
39
    0x18ee87f7c <+92>:  b.ne   0x18ee87f88    ; <+104>
40
    0x18ee87f80 <+96>:  mov    x0, sp
41
    0x18ee87f84 <+100>: b      0x18ee87fb4    ; <+148>
42
    0x18ee87f88 <+104>: cbz    x20, 0x18ee87fac ; <+140>
43
    0x18ee87f8c <+108>: mov    x21, #0x0 ; =0
44
    0x18ee87f90 <+112>: mov    x0, x19
45
    0x18ee87f94 <+116>: mov    x2, x21
46
    0x18ee87f98 <+120>: bl     0x18fc891e0    ; objc_msgSend$characterAtIndex:
47
    0x18ee87f9c <+124>: cbz    w0, 0x18ee87fbc ; <+156>
48
    0x18ee87fa0 <+128>: add    x21, x21, #0x1
49
    0x18ee87fa4 <+132>: cmp    x20, x21
50
    0x18ee87fa8 <+136>: b.ne   0x18ee87f90    ; <+112>
51
    0x18ee87fac <+140>: mov    x0, x19
52
    0x18ee87fb0 <+144>: bl     0x18fc7b220    ; objc_msgSend$UTF8String
53
    0x18ee87fb4 <+148>: bl     0x18f88322c    ; symbol stub for: sel_registerName
54
    0x18ee87fb8 <+152>: b      0x18ee87fc0    ; <+160>
55
    0x18ee87fbc <+156>: mov    x0, #0x0 ; =0
56
    0x18ee87fc0 <+160>: ldur   x8, [x29, #-0x28]
57
    0x18ee87fc4 <+164>: adrp   x9, 402965
58
    0x18ee87fc8 <+168>: ldr    x9, [x9, #0x388]
59
    0x18ee87fcc <+172>: ldr    x9, [x9]
60
    0x18ee87fd0 <+176>: cmp    x9, x8
61
    0x18ee87fd4 <+180>: b.ne   0x18ee87fec    ; <+204>
62
    0x18ee87fd8 <+184>: add    sp, sp, #0x3f0
63
    0x18ee87fdc <+188>: ldp    x29, x30, [sp, #0x20]
64
    0x18ee87fe0 <+192>: ldp    x20, x19, [sp, #0x10]
65
    0x18ee87fe4 <+196>: ldp    x22, x21, [sp], #0x30
66
    0x18ee87fe8 <+200>: retab
67
    0x18ee87fec <+204>: bl     0x18f8811cc    ; symbol stub for: __stack_chk_fail

NSSelectorFromString 反汇编路径解读#

前半部分很简单，是把传入的NSString转为CString（实际是一个char数组）。先调用NSString的getCString:maxLength:encoding: ，如果失败就尝试调用strlen

这个CString会传递给sel_registerName 。正好 sel_registerName 是开源的，我们看看里面具体逻辑：

sel_registerName#

objc4/runtime/objc-sel.mm at fb265098298302243cd7eeaa1f63f0ba7786dd9a · apple-oss-distributions/objc4

Contribute to apple-oss-distributions/objc4 development by creating an account on GitHub.

github.com

1
static objc::ExplicitInitDenseSet<const char *> namedSelectors;
2

3
...
4

5
SEL sel_registerName(const char *name) {
6
    return __sel_registerName(name, 1, 1);     // YES lock, YES copy
7
}
8

9
...
10

11
static SEL __sel_registerName(const char *name, bool shouldLock, bool copy)
12
{
13
    SEL result = 0;
14

15
    if (shouldLock) lockdebug::assert_unlocked(&selLock.get());
16
    else            lockdebug::assert_locked(&selLock.get());
17

18
    if (!name) return (SEL)0;
19

20
    result = _sel_searchBuiltins(name);
21
    if (result) return result;
22

23
    conditional_mutex_locker_t lock(selLock, shouldLock);
24
  auto it = namedSelectors.get().insert(name);
25
  if (it.second) {
26
    // No match. Insert.
27
    *it.first = (const char *)sel_alloc(name, copy);
28
  }
29
  return (SEL)*it.first;
30
}
31

32
...
33

34
SEL _sel_searchBuiltins(const char *name)
35
{
36
#if SUPPORT_PREOPT
37
  if (SEL result = (SEL)_dyld_get_objc_selector(name))
38
    return result;
39
#endif
40
    return nil;
41
}
42

43
...
44

45
static SEL sel_alloc(const char *name, bool copy)
46
{
47
    lockdebug::assert_locked(&selLock.get());
48
    return (SEL)(copy ? strdupIfMutable(name) : name);
49
}

逻辑非常清晰也很简单，主要分为以下几步：

判断该selector有没有加载过。实际是调用 dyld 的 _dyld_get_objc_selector 尝试获取selector表，有则直接返回。
尝试往 namedSelectors 插入selector。namedSelectors 是一个 ExplicitInitDenseSetDenseSet<const char *> ，可以简单理解为一个Set，在插入时会比对字符串的值。如果存在相同的值就直接返回，否则新插入一条selector。

为什么插入的时候会比较字符串内容呢？

DenseSet#

实际上 ExplicitInitDenseSetDenseSet<const char *> 有以下继承关系：

ExplicitInitDenseSetDenseSet<const char *> ← ExplicitInit<DenseSet<const char *>>

ExplicitInit 可暂时不看，是方便初始化用的。我们先来看看 DenseSet<const char *> ，实际实现是

objc4/runtime/llvm-DenseSet.h at fb265098298302243cd7eeaa1f63f0ba7786dd9a · apple-oss-distributions/objc4

Contribute to apple-oss-distributions/objc4 development by creating an account on GitHub.

github.com

1
template <typename ValueT, typename ValueInfoT = DenseMapInfo<ValueT>>
2
class DenseSet : public detail::DenseSetImpl<
3
                     ValueT, DenseMap<ValueT, detail::DenseSetEmpty,
4
                                      DenseMapValueInfo<detail::DenseSetEmpty>,
5
                                      ValueInfoT, detail::DenseSetPair<ValueT>>,
6
                     ValueInfoT> {
7
  using BaseT =
8
      detail::DenseSetImpl<ValueT,
9
                           DenseMap<ValueT, detail::DenseSetEmpty,
10
                                    DenseMapValueInfo<detail::DenseSetEmpty>,
11
                                    ValueInfoT, detail::DenseSetPair<ValueT>>,
12
                           ValueInfoT>;
13

14
public:
15
  using BaseT::BaseT;
16
};

上面的ValueT = const char * ，那么 DenseSetImpl 是什么呢

1
/// Base class for DenseSet and DenseSmallSet.
2
///
3
/// MapTy should be either
4
///
5
///   DenseMap<ValueT, detail::DenseSetEmpty,
6
///            DenseMapValueInfo<detail::DenseSetEmpty>,
7
///            ValueInfoT, detail::DenseSetPair<ValueT>>
8
///
9
/// or the equivalent SmallDenseMap type.  ValueInfoT must implement the
10
/// DenseMapInfo "concept".
11
template <typename ValueT, typename MapTy, typename ValueInfoT>
12
class DenseSetImpl {
13
  static_assert(sizeof(typename MapTy::value_type) == sizeof(ValueT),
14
                "DenseMap buckets unexpectedly large!");
15
  MapTy TheMap;
16

17
  template <typename T>
18
  using const_arg_type_t = typename const_pointer_or_const_ref<T>::type;
19

20
public:
21
  using key_type = ValueT;
22
  using value_type = ValueT;
23
  using size_type = unsigned;
24

25
...
26

27
std::pair<iterator, bool> insert(const ValueT &V) {
28
    detail::DenseSetEmpty Empty;
29
    return TheMap.try_emplace(V, Empty);
30
  }
31

32
  std::pair<iterator, bool> insert(ValueT &&V) {
33
    detail::DenseSetEmpty Empty;
34
    return TheMap.try_emplace(std::move(V), Empty);
35
  }
36
...

可见，DenseSet 实际是一个Key为实际值（这里是const char *），Value为空的 DenseMap 。DenseMap 会通过template类型调用 DenseSet 的 insert ，实际是调用 DenseMap 的 try_emplace 。

DenseMap#

DenseMap 的原型如下：

objc4/runtime/llvm-DenseMap.h at fb265098298302243cd7eeaa1f63f0ba7786dd9a · apple-oss-distributions/objc4

Contribute to apple-oss-distributions/objc4 development by creating an account on GitHub.

github.com

1
template <typename KeyT, typename ValueT,
2
          typename ValueInfoT = DenseMapValueInfo<ValueT>,
3
          typename KeyInfoT = DenseMapInfo<KeyT>,
4
          typename BucketT = detail::DenseMapPair<KeyT, ValueT>>
5
class DenseMap : public DenseMapBase<DenseMap<KeyT, ValueT, ValueInfoT, KeyInfoT, BucketT>,
6
                                     KeyT, ValueT, ValueInfoT, KeyInfoT, BucketT> {
7
  friend class DenseMapBase<DenseMap, KeyT, ValueT, ValueInfoT, KeyInfoT, BucketT>;
8
  ...

结构有点复杂，但可以注意到，DenseMap 是通过 KeyInfoT = DenseMapInfo<KeyT> 来获取Key的信息的，比如Key的哈希值、两个Key是否相等等。而 DenseMapInfo<const char *> 是什么呢？

objc4/runtime/llvm-DenseMapInfo.h at fb265098298302243cd7eeaa1f63f0ba7786dd9a · apple-oss-distributions/objc4

Contribute to apple-oss-distributions/objc4 development by creating an account on GitHub.

github.com

1
// Provide DenseMapInfo for cstrings.
2
template<> struct DenseMapInfo<const char*> {
3
  static inline const char* getEmptyKey() {
4
    return reinterpret_cast<const char *>((intptr_t)-1);
5
  }
6
  static inline const char* getTombstoneKey() {
7
    return reinterpret_cast<const char *>((intptr_t)-2);
8
  }
9
  static unsigned getHashValue(const char* const &Val) {
10
    return _objc_strhash(Val);
11
  }
12
  static bool isEqual(const char* const &LHS, const char* const &RHS) {
13
    if (LHS == RHS) {
14
      return true;
15
    }
16
    if (LHS == getEmptyKey() || RHS == getEmptyKey()) {
17
      return false;
18
    }
19
    if (LHS == getTombstoneKey() || RHS == getTombstoneKey()) {
20
      return false;
21
    }
22
    return 0 == strcmp(LHS, RHS);
23
  }
24
};
25

26
// objc-private.h
27
static __inline uint32_t _objc_strhash(const char *s) {
28
    uint32_t hash = 0;
29
    for (;;) {
30
    int a = *s++;
31
    if (0 == a) break;
32
    hash += (hash << 8) + a;
33
    }
34
    return hash;
35
}

注意 getHashValue 和 isEqual 两个方法，说明 DenseSet<const char *> 是通过字符串本身计算哈希值。所以有两个值相同，但地址不同的字符串存入 DenseSet<const char*> ，最后只会存一份字符串（是否共享字符串内存取决于 copy 参数与是否 strdupIfMutable）。这也能够说明为什么同一个字符串只能对应一个selector。

select进入namedSelectors的时机#

selector sayHi是什么时候插入进namedSelectors 的？ namedSelectors 是什么时候初始化的？

通过观察，我们可以看到存在：

1
map_images → map_images_nolock → sel_init
2
                         ↓
3
                     _read_images → sel_registerNameNoLock → namedSelectors

这条调用链。

objc4/runtime/objc-sel.mm at fb265098298302243cd7eeaa1f63f0ba7786dd9a · apple-oss-distributions/objc4

Contribute to apple-oss-distributions/objc4 development by creating an account on GitHub.

github.com

1
/***********************************************************************
2
* sel_init
3
* Initialize selector tables and register selectors used internally.
4
**********************************************************************/
5
void sel_init(size_t selrefCount)
6
{
7
#if SUPPORT_PREOPT
8
    if (PrintPreopt) {
9
        _objc_inform("PREOPTIMIZATION: using dyld selector opt");
10
    }
11
#endif
12

13
  namedSelectors.init((unsigned)selrefCount);
14

15
    // Register selectors used by libobjc
16

17
    mutex_locker_t lock(selLock);
18

19
    SEL_cxx_construct = sel_registerNameNoLock(".cxx_construct", NO);
20
    SEL_cxx_destruct = sel_registerNameNoLock(".cxx_destruct", NO);
21
}

sel_init 是谁调用的？

objc4/runtime/objc-os.mm at fb265098298302243cd7eeaa1f63f0ba7786dd9a · apple-oss-distributions/objc4

Contribute to apple-oss-distributions/objc4 development by creating an account on GitHub.

github.com

1
void
2
map_images_nolock(unsigned mhCount, const struct _dyld_objc_notify_mapped_info infos[],
3
                  bool *disabledClassROEnforcement,
4
                  _dyld_objc_mark_image_mutable makeImageMutable)
5
{
6
  ...
7
  if (firstTime) {
8
        sel_init(selrefCount);
9
        ...
10
}

而map_images_nolock是谁调用的呢？

objc4/runtime/objc-runtime-new.mm at fb265098298302243cd7eeaa1f63f0ba7786dd9a · apple-oss-distributions/objc4

Contribute to apple-oss-distributions/objc4 development by creating an account on GitHub.

github.com

1
/***********************************************************************
2
* map_images
3
* Process the given images which are being mapped in by dyld.
4
* Calls ABI-agnostic code after taking ABI-specific locks.
5
*
6
* Locking: write-locks runtimeLock
7
**********************************************************************/
8
void
9
map_images(unsigned count, const struct _dyld_objc_notify_mapped_info infos[],
10
           _dyld_objc_mark_image_mutable makeImageMutable)
11
{
12
    bool takeEnforcementDisableFault;
13

14
    {
15
        mutex_locker_t lock(runtimeLock);
16
        map_images_nolock(count, infos, &takeEnforcementDisableFault, makeImageMutable);
17
    }
18

19
    if (takeEnforcementDisableFault) {
20
        if (DebugClassRXSigning == Fatal)
21
            _objc_fatal("class_rx signing mismatch");
22

23
#if TARGET_OS_IPHONE && !TARGET_OS_SIMULATOR
24
        if (!DisableClassROFaults)
25
            _objc_fault("class_ro_t enforcement disabled");
26
#endif
27
    }
28
}

map_images_nolock 由 map_images 调用。上一章也提到，map_images 在动态库载入时会被调用，所以在程序初始化时就能够完成 namedSelectors 的初始化。

初始化的流程我们理解了，但是问题还没解决：映像 __objc_selrefs 段里存的selector什么时候转到namedSelectors 里呢？

上面的代码可以看到，map_images 会调用 map_images_nolock 。而在 map_images_nolock 里会调用一个函数 _read_images

1
void
2
map_images_nolock(unsigned mhCount, const struct _dyld_objc_notify_mapped_info infos[],
3
                  bool *disabledClassROEnforcement,
4
                  _dyld_objc_mark_image_mutable makeImageMutable)
5
{
6
...
7
      if (hCount > 0) {
8
          _read_images(mappedInfos, hCount, totalClasses, unoptimizedTotalClasses, makeImageMutable);
9
      }
10
}

我们看看里面的作用

::url-card={url=“https://github.com/apple-oss-distributions/objc4/blob/fb265098298302243cd7eeaa1f63f0ba7786dd9a/runtime/objc-runtime-new.mm”}

1
/***********************************************************************
2
* _read_images
3
* Perform initial processing of the headers in the linked
4
* list beginning with headerList.
5
*
6
* Called by: map_images_nolock
7
*
8
* Locking: runtimeLock acquired by map_images
9
**********************************************************************/
10
void _read_images(mapped_image_info infosParam[], uint32_t hCount, int totalClasses, int unoptimizedTotalClasses,
11
                  _dyld_objc_mark_image_mutable makeImageMutable)
12
{
13
  ...
14
 static size_t UnfixedSelectors;
15
    {
16
        mutex_locker_t lock(selLock);
17
        for (auto& info : infos) {
18
            if (info.dyldObjCRefsOptimized()) continue;
19

20
            bool isBundle = info.hi->isBundle();
21
            SEL *sels = info.hi->selrefs(&count);
22
            UnfixedSelectors += count;
23
            for (i = 0; i < count; i++) {
24
                const char *name = sel_cname(sels[i]);
25
                SEL sel = sel_registerNameNoLock(name, isBundle);
26
                if (sels[i] != sel) {
27
                    // The infos array is reversed, but dyld expects the original index
28
                    const uint32_t infoIndex = (hCount - 1) - infos.index(&info);
29

30
                    makeImageMutable(infoIndex);
31
                    withMutableSharedCache(info.tproEnabled(), [&] {
32
                        sels[i] = sel;
33
                    });
34
                }
35
            }
36
        }
37
    }
38
    ...
39
}

这里调用了 sel_registerNameNoLock ，实际就是将映像里的selector一个个地存进namedSelectors 里。所以能够说明：

@selector(...) 与 NSSelectorFromString(...) 会得到同一个 selector
因为 selector 已在加载期完成注册/驻留

发送消息#

发送消息部分，就是Objective-C的精髓。

猜测#

已知，对象的isa里存着baseMethods ，实际是一个数组，每一项是{SEL & type（传参类型） & 跳转地址 }。所以我们可以猜测：

每次调用objc_msgSend，都会去对象的 baseMethods 里查跳转地址并执行跳转。

但是 baseMethods 是一个数组，每调一次 objc_msgSend 都需要去遍历数组查找实现，能不能把 baseMethods 存在一个map里，这样查找的时间复杂度就下来了？所以：

每个isa里存在一个map缓存，key为selector。调用 objc_msgSend 时会先去这个缓存查找，如果没找到再去 baseMethods 里查找。

确实，Apple也是这么做的。

深入汇编#

因为objc_msgSend 的调用频次很高。Apple为了提升效率，特意用汇编来实现，属实良心。

不同CPU架构下的汇编指令也会不同。Apple甚至对不同的CPU做了指令差异处理，太良心了。

不过，核心逻辑是大致相同的。我们这里以arm64架构为例，先来看看 objc_msgSend 的具体实现：

objc4/runtime/Messengers.subproj/objc-msg-arm64.s at fb265098298302243cd7eeaa1f63f0ba7786dd9a · apple-oss-distributions/objc4

Contribute to apple-oss-distributions/objc4 development by creating an account on GitHub.

github.com

1
  MSG_ENTRY _objc_msgSend
2
  UNWIND _objc_msgSend, NoFrame
3

4
  cmp  p0, #0      // nil check and tagged pointer check
5
#if SUPPORT_TAGGED_POINTERS
6
  b.le  LNilOrTagged    //  (MSB tagged pointer looks negative)
7
#else
8
  b.eq  LReturnZero
9
#endif
10
  ldr  p14, [x0]    // p14 = raw isa
11
  GetClassFromIsa_p16 p14, 1, x0  // p16 = class
12
LGetIsaDone:
13
  // calls imp or objc_msgSend_uncached
14
  CacheLookup NORMAL, _objc_msgSend, __objc_msgSend_uncached

objc_msgSend 的关键步骤可以概括为：

nil / tagged pointer 检查
从对象取出 isa
在 class cache 中查找 selector → IMP
命中则跳转；未命中则进入慢路径

我让G老师写了一段伪代码，方便理解：

1
IMP objc_msgSend(id receiver, SEL sel, ...) {
2
    if (receiver == nil) return 0;
3

4
    cls = decode_isa(receiver->isa);
5
    imp = cache_lookup(cls, sel);
6

7
    if (imp == NULL) {
8
        imp = __objc_msgSend_uncached(cls, sel);
9
    }
10

11
    return imp(receiver, sel, ...);
12
}

LNilOrTagged#

LNilOrTagged 是什么呢？

1
#if SUPPORT_TAGGED_POINTERS
2
LNilOrTagged:
3
  b.eq  LReturnZero    // nil check
4
  GetTaggedClass
5
  b  LGetIsaDone
6
// SUPPORT_TAGGED_POINTERS
7
#endif
8

9
LReturnZero:
10
  // x0 is already zero
11
  mov  x1, #0
12
  movi  d0, #0
13
  movi  d1, #0
14
  movi  d2, #0
15
  movi  d3, #0
16
  ret
17

18
  END_ENTRY _objc_msgSend

其实就是：

如果传入的对象为nil，就直接跳到LReturnZero
否则解析Tagged Pointer拿到class 指针，存在x16

这里也能够说明，为什么Objective-C里能够对空指针发消息。

拿到了isa，就到了缓存查找部分：CacheLookup

CacheLookup#

缓存是什么？回顾上一章，我们复习一下isa 的结构：

objc4/runtime/objc-runtime-new.h at fb265098298302243cd7eeaa1f63f0ba7786dd9a · apple-oss-distributions/objc4

Contribute to apple-oss-distributions/objc4 development by creating an account on GitHub.

github.com

1
struct objc_class : objc_object {
2
  objc_class(const objc_class&) = delete;
3
  objc_class(objc_class&&) = delete;
4
  void operator=(const objc_class&) = delete;
5
  void operator=(objc_class&&) = delete;
6
    // Class ISA;
7
    Class superclass;
8
    cache_t cache;             // formerly cache pointer and vtable
9
    class_data_bits_t bits;    // class_rw_t * plus custom rr/alloc flags
10
    ...

cache_t是什么？

objc4/runtime/objc-runtime-new.h at fb265098298302243cd7eeaa1f63f0ba7786dd9a · apple-oss-distributions/objc4

Contribute to apple-oss-distributions/objc4 development by creating an account on GitHub.

github.com

1
struct cache_t {
2
private:
3
    explicit_atomic<uintptr_t> _bucketsAndMaybeMask;
4
    union {
5
        // Note: _flags on ARM64 needs to line up with the unused bits of
6
        // _originalPreoptCache because we access some flags (specifically
7
        // FAST_CACHE_HAS_DEFAULT_CORE and FAST_CACHE_HAS_DEFAULT_AWZ) on
8
        // unrealized classes with the assumption that they will start out
9
        // as 0.
10
        struct {
11
#if CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_OUTLINED && !__LP64__
12
            // Outlined cache mask storage, 32-bit, we have mask and occupied.
13
            explicit_atomic<mask_t>    _mask;
14
            uint16_t                   _occupied;
15
#elif CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_OUTLINED && __LP64__
16
            // Outlined cache mask storage, 64-bit, we have mask, occupied, flags.
17
            explicit_atomic<mask_t>    _mask;
18
            uint16_t                   _occupied;
19
            uint16_t                   _flags;
20
#   define CACHE_T_HAS_FLAGS 1
21
#elif __LP64__
22
            // Inline cache mask storage, 64-bit, we have occupied, flags, and
23
            // empty space to line up flags with originalPreoptCache.
24
            //
25
            // Note: the assembly code for objc_release_xN knows about the
26
            // location of _flags and the
27
            // FAST_CACHE_HAS_CUSTOM_DEALLOC_INITIATION flag within. Any changes
28
            // must be applied there as well.
29
            uint32_t                   _disguisedPreoptCacheSignature;
30
            uint16_t                   _occupied;
31
            uint16_t                   _flags;
32
#   define CACHE_T_HAS_FLAGS 1
33
#else
34
            // Inline cache mask storage, 32-bit, we have occupied, flags.
35
            uint16_t                   _occupied;
36
            uint16_t                   _flags;
37
#   define CACHE_T_HAS_FLAGS 1
38
#endif
39

40
        };
41
        explicit_atomic<preopt_cache_t *, PTRAUTH_STR(originalPreoptCache, ptrauth_key_process_independent_data)> _originalPreoptCache;
42
    };

_bucketsAndMaybeMask 是什么呢？我们上一章讲过，指针被mask是为了安全。其实，这里buckets的实际内存布局是：

1
[ bucket0 ][ bucket1 ][ bucket2 ][ bucket3 ] ...

bucket是什么呢？

1
struct bucket_t {
2
private:
3
    // IMP-first is better for arm64e ptrauth and no worse for arm64.
4
    // SEL-first is better for armv7* and i386 and x86_64.
5
#if __arm64__
6
    explicit_atomic<uintptr_t> _imp;
7
    explicit_atomic<SEL> _sel;
8
#else
9
    explicit_atomic<SEL> _sel;
10
    explicit_atomic<uintptr_t> _imp;
11
#endif

说白了就是imp+sel的紧凑结构。

现在先来一个思考题，已知isa的地址，怎么获取bucket的基地？

1
// 1️⃣ decode isa（mask 取 class pointer）
2
Class cls = raw_isa & ISA_MASK;
3

4
// 2️⃣ cache_t 在 class 内的偏移
5
cache_t *cache = (cache_t *)((uint8_t*)cls + CACHE_OFFSET);
6

7
// 3️⃣ buckets 在 cache_t 内的偏移
8
bucket_t *buckets = *(bucket_t **)((uint8_t*)cache + BUCKETS_OFFSET);

并且：

CACHE_OFFSET = 0x10
BUCKETS_OFFSET = 0x00

我们开始看CacheLookUp的汇编代码

1
/********************************************************************
2
 *
3
 * CacheLookup NORMAL|GETIMP|LOOKUP <function> MissLabelDynamic MissLabelConstant
4
 *
5
 * MissLabelConstant is only used for the GETIMP variant.
6
 *
7
 * Locate the implementation for a selector in a class method cache.
8
 *
9
 * When this is used in a function that doesn't hold the runtime lock,
10
 * this represents the critical section that may access dead memory.
11
 * If the kernel causes one of these functions to go down the recovery
12
 * path, we pretend the lookup failed by jumping the JumpMiss branch.
13
 *
14
 * Takes:
15
 *   x1 = selector
16
 *   x16 = class to be searched
17
 *
18
 * Kills:
19
 *    x9,x10,x11,x12,x13,x15,x17
20
 *
21
 * Untouched:
22
 *    x14
23
 *
24
 * On exit: (found) calls or returns IMP
25
 *                  with x16 = class, x17 = IMP
26
 *                  In LOOKUP mode, the two low bits are set to 0x3
27
 *                  if we hit a constant cache (used in objc_trace)
28
 *          (not found) jumps to LCacheMiss
29
 *                  with x15 = class
30
 *                  For constant caches in LOOKUP mode, the low bit
31
 *                  of x16 is set to 0x1 to indicate we had to fallback.
32
 *          In addition, when LCacheMiss is __objc_msgSend_uncached or
33
 *          __objc_msgLookup_uncached, 0x2 will be set in x16
34
 *          to remember we took the slowpath.
35
 *          So the two low bits of x16 on exit mean:
36
 *            0: dynamic hit
37
 *            1: fallback to the parent class, when there is a preoptimized cache
38
 *            2: slowpath
39
 *            3: preoptimized cache hit
40
 *
41
 ********************************************************************/
42

43
#define NORMAL 0
44
#define GETIMP 1
45
#define LOOKUP 2
46

47
// CacheHit: x17 = cached IMP, x10 = address of buckets, x1 = SEL, x16 = isa
48
.macro CacheHit
49
.if $0 == NORMAL
50
  TailCallCachedImp x17, x10, x1, x16  // authenticate and call imp
51
.elseif $0 == GETIMP
52
  mov  p0, p17
53
  cbz  p0, 9f          // don't ptrauth a nil imp
54
  AuthAndResignAsIMP x0, x10, x1, x16, x17  // authenticate imp and re-sign as IMP
55
9:  ret            // return IMP
56
.elseif $0 == LOOKUP
57
  // No nil check for ptrauth: the caller would crash anyway when they
58
  // jump to a nil IMP. We don't care if that jump also fails ptrauth.
59
  AuthAndResignAsIMP x17, x10, x1, x16, x10  // authenticate imp and re-sign as IMP
60
  cmp  x16, x15
61
  cinc  x16, x16, ne      // x16 += 1 when x15 != x16 (for instrumentation ; fallback to the parent class)
62
  ret        // return imp via x17
63
.else
64
.abort oops
65
.endif
66
.endmacro
67

68
.macro CacheLookup Mode, Function, MissLabelDynamic, MissLabelConstant
69
  //
70
  // Restart protocol:
71
  //
72
  //   As soon as we're past the LLookupStart\Function label we may have
73
  //   loaded an invalid cache pointer or mask.
74
  //
75
  //   When task_restartable_ranges_synchronize() is called,
76
  //   (or when a signal hits us) before we're past LLookupEnd\Function,
77
  //   then our PC will be reset to LLookupRecover\Function which forcefully
78
  //   jumps to the cache-miss codepath which have the following
79
  //   requirements:
80
  //
81
  //   GETIMP:
82
  //     The cache-miss is just returning NULL (setting x0 to 0)
83
  //
84
  //   NORMAL and LOOKUP:
85
  //   - x0 contains the receiver
86
  //   - x1 contains the selector
87
  //   - x16 contains the isa
88
  //   - other registers are set as per calling conventions
89
  //
90

91
  mov  x15, x16      // stash the original isa
92
LLookupStart\Function:
93
  // p1 = SEL, p16 = isa
94
#if CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_HIGH_16_BIG_ADDRS
95
  ldr  p10, [x16, #CACHE]        // p10 = mask|buckets
96
  lsr  p11, p10, #48      // p11 = mask
97
  and  p10, p10, #0xffffffffffff  // p10 = buckets
98
  and  w12, w1, w11      // x12 = _cmd & mask
99
#elif CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_HIGH_16
100
  ldr  p11, [x16, #CACHE]      // p11 = mask|buckets
101
#if CONFIG_USE_PREOPT_CACHES
102
#if __has_feature(ptrauth_calls)
103
  tbnz  p11, #0, LLookupPreopt\Function
104
  and  p10, p11, #0x0000ffffffffffff  // p10 = buckets
105
#else
106
  and  p10, p11, #0x0000fffffffffffe  // p10 = buckets
107
  tbnz  p11, #0, LLookupPreopt\Function
108
#endif
109
  eor  p12, p1, p1, LSR #7
110
  and  p12, p12, p11, LSR #48    // x12 = (_cmd ^ (_cmd >> 7)) & mask
111
#else
112
  and  p10, p11, #0x0000ffffffffffff  // p10 = buckets
113
  and  p12, p1, p11, LSR #48    // x12 = _cmd & mask
114
#endif // CONFIG_USE_PREOPT_CACHES
115
#elif CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_LOW_4
116
  ldr  p11, [x16, #CACHE]        // p11 = mask|buckets
117
  and  p10, p11, #~0xf      // p10 = buckets
118
  and  p11, p11, #0xf      // p11 = maskShift
119
  mov  p12, #0xffff
120
  lsr  p11, p12, p11      // p11 = mask = 0xffff >> p11
121
  and  p12, p1, p11      // x12 = _cmd & mask
122
#else
123
#error Unsupported cache mask storage for ARM64.
124
#endif
125

126
  add  p13, p10, p12, LSL #(1+PTRSHIFT)
127
            // p13 = buckets + ((_cmd & mask) << (1+PTRSHIFT))
128

129
            // do {
130
1:  ldp  p17, p9, [x13], #-BUCKET_SIZE  //     {imp, sel} = *bucket--
131
  cmp  p9, p1        //     if (sel != _cmd) {
132
  b.ne  3f        //         scan more
133
            //     } else {
134
2:  CacheHit \Mode        // hit:    call or return imp
135
            //     }
136
3:  cbz  p9, \MissLabelDynamic    //     if (sel == 0) goto Miss;
137
  cmp  p13, p10      // } while (bucket >= buckets)
138
  b.hs  1b
139

140
  // wrap-around:
141
  //   p10 = first bucket
142
  //   p11 = mask (and maybe other bits on LP64)
143
  //   p12 = _cmd & mask
144
  //
145
  // A full cache can happen with CACHE_ALLOW_FULL_UTILIZATION.
146
  // So stop when we circle back to the first probed bucket
147
  // rather than when hitting the first bucket again.
148
  //
149
  // Note that we might probe the initial bucket twice
150
  // when the first probed slot is the last entry.
151

152
#if CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_HIGH_16_BIG_ADDRS
153
  add  p13, p10, w11, UXTW #(1+PTRSHIFT)
154
            // p13 = buckets + (mask << 1+PTRSHIFT)
155
#elif CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_HIGH_16
156
  add  p13, p10, p11, LSR #(48 - (1+PTRSHIFT))
157
            // p13 = buckets + (mask << 1+PTRSHIFT)
158
            // see comment about maskZeroBits
159
#elif CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_LOW_4
160
  add  p13, p10, p11, LSL #(1+PTRSHIFT)
161
            // p13 = buckets + (mask << 1+PTRSHIFT)
162
#else
163
#error Unsupported cache mask storage for ARM64.
164
#endif
165
  add  p12, p10, p12, LSL #(1+PTRSHIFT)
166
            // p12 = first probed bucket
167

168
            // do {
169
4:  ldp  p17, p9, [x13], #-BUCKET_SIZE  //     {imp, sel} = *bucket--
170
  cmp  p9, p1        //     if (sel == _cmd)
171
  b.eq  2b        //         goto hit
172
  cmp  p9, #0        // } while (sel != 0 &&
173
  ccmp  p13, p12, #0, ne    //     bucket > first_probed)
174
  b.hi  4b
175

176
LLookupEnd\Function:
177
LLookupRecover\Function:
178
  b  \MissLabelDynamic
179

180
#if CONFIG_USE_PREOPT_CACHES
181
#if CACHE_MASK_STORAGE != CACHE_MASK_STORAGE_HIGH_16
182
#error config unsupported
183
#endif
184
LLookupPreopt\Function:
185
#if __has_feature(ptrauth_calls)
186
  and  p10, p11, #0x007ffffffffffffe  // p10 = buckets
187
  autdb  x10, x16      // auth as early as possible
188
#endif
189

190
  // x12 = (_cmd - first_shared_cache_sel)
191
  adrp  x9, _MagicSelRef@PAGE
192
  ldr  p9, [x9, _MagicSelRef@PAGEOFF]
193
  sub  p12, p1, p9
194

195
  // w9  = ((_cmd - first_shared_cache_sel) >> hash_shift & hash_mask)
196
#if __has_feature(ptrauth_calls)
197
  // bits 63..60 of x11 are the number of bits in hash_mask
198
  // bits 59..55 of x11 is hash_shift
199

200
  lsr  x17, x11, #55      // w17 = (hash_shift, ...)
201
  lsr  w9, w12, w17      // >>= shift
202

203
  lsr  x17, x11, #60      // w17 = mask_bits
204
  mov  x11, #0x7fff
205
  lsr  x11, x11, x17      // p11 = mask (0x7fff >> mask_bits)
206
  and  x9, x9, x11      // &= mask
207
#else
208
  // bits 63..53 of x11 is hash_mask
209
  // bits 52..48 of x11 is hash_shift
210
  lsr  x17, x11, #48      // w17 = (hash_shift, hash_mask)
211
  lsr  w9, w12, w17      // >>= shift
212
  and  x9, x9, x11, LSR #53    // &=  mask
213
#endif
214

215
  // sel_offs is 26 bits because it needs to address a 64 MB buffer (~ 20 MB as of writing)
216
  // keep the remaining 38 bits for the IMP offset, which may need to reach
217
  // across the shared cache. This offset needs to be shifted << 2. We did this
218
  // to give it even more reach, given the alignment of source (the class data)
219
  // and destination (the IMP)
220
  ldr  x17, [x10, x9, LSL #3]    // x17 == (sel_offs << 38) | imp_offs
221
  cmp  x12, x17, LSR #38
222

223
.if \Mode == GETIMP
224
  b.ne  \MissLabelConstant    // cache miss
225
  sbfiz x17, x17, #2, #38         // imp_offs = combined_imp_and_sel[0..37] << 2
226
  sub  x0, x16, x17            // imp = isa - imp_offs
227
  SignAsImp x0, x17
228
  ret
229
.else
230
  b.ne  5f                // cache miss
231
  sbfiz x17, x17, #2, #38         // imp_offs = combined_imp_and_sel[0..37] << 2
232
  sub x17, x16, x17               // imp = isa - imp_offs
233
.if \Mode == NORMAL
234
  br  x17
235
.elseif \Mode == LOOKUP
236
  orr x16, x16, #3 // for instrumentation, note that we hit a constant cache
237
  SignAsImp x17, x10
238
  ret
239
.else
240
.abort  unhandled mode \Mode
241
.endif
242

243
5:  ldur  x9, [x10, #-16]      // offset -16 is the fallback offset
244
  add  x16, x16, x9      // compute the fallback isa
245
  b  LLookupStart\Function    // lookup again with a new isa
246
.endif
247
#endif // CONFIG_USE_PREOPT_CACHES
248

249
.endmacro

代码非常多。其实，这个代码一共分成三个阶段：

准备数据#

1
  mov  x15, x16      // stash the original isa
2
LLookupStart\Function:
3
  // p1 = SEL, p16 = isa
4
#if CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_HIGH_16_BIG_ADDRS
5
  ldr  p10, [x16, #CACHE]        // p10 = mask|buckets
6
  lsr  p11, p10, #48      // p11 = mask
7
  and  p10, p10, #0xffffffffffff  // p10 = buckets
8
  and  w12, w1, w11      // x12 = _cmd & mask
9
#elif CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_HIGH_16
10
  ldr  p11, [x16, #CACHE]      // p11 = mask|buckets
11
#if CONFIG_USE_PREOPT_CACHES
12
#if __has_feature(ptrauth_calls)
13
  tbnz  p11, #0, LLookupPreopt\Function
14
  and  p10, p11, #0x0000ffffffffffff  // p10 = buckets
15
#else
16
  and  p10, p11, #0x0000fffffffffffe  // p10 = buckets
17
  tbnz  p11, #0, LLookupPreopt\Function
18
#endif
19
  eor  p12, p1, p1, LSR #7
20
  and  p12, p12, p11, LSR #48    // x12 = (_cmd ^ (_cmd >> 7)) & mask
21
#else
22
  and  p10, p11, #0x0000ffffffffffff  // p10 = buckets
23
  and  p12, p1, p11, LSR #48    // x12 = _cmd & mask
24
#endif // CONFIG_USE_PREOPT_CACHES
25
#elif CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_LOW_4
26
  ldr  p11, [x16, #CACHE]        // p11 = mask|buckets
27
  and  p10, p11, #~0xf      // p10 = buckets
28
  and  p11, p11, #0xf      // p11 = maskShift
29
  mov  p12, #0xffff
30
  lsr  p11, p12, p11      // p11 = mask = 0xffff >> p11
31
  and  p12, p1, p11      // x12 = _cmd & mask
32
#else
33
#error Unsupported cache mask storage for ARM64.
34
#endif

首先，外部固定会传入：

x1: sel
x16: isa

接着，会从isa+${#CACHE}取值并存到x10里。而CACHE是什么呢？

1
#define CACHE            (2 * __SIZEOF_POINTER__)

所以，x10实际是buckets

再者，x12 = _cmd & mask。mask是什么？mask=缓存的大小-1。所以，x12就是sel计算后的缓存索引位置。用hashmap的话来讲，是哈希值。

1
[bucket0][bucket1][bucket2][bucket3][bucket4][bucket5][bucket6][bucket7]
2
                                   ^
3
                                   │
4
                                  x12   ← (_cmd & mask) 得到的 index

从中往前遍历#

1
  add  p13, p10, p12, LSL #(1+PTRSHIFT)
2
            // p13 = buckets + ((_cmd & mask) << (1+PTRSHIFT))
3

4
            // do {
5
1:  ldp  p17, p9, [x13], #-BUCKET_SIZE  //     {imp, sel} = *bucket--
6
  cmp  p9, p1        //     if (sel != _cmd) {
7
  b.ne  3f        //         scan more
8
            //     } else {
9
2:  CacheHit \Mode        // hit:    call or return imp
10
            //     }
11
3:  cbz  p9, \MissLabelDynamic    //     if (sel == 0) goto Miss;
12
  cmp  p13, p10      // } while (bucket >= buckets)
13
  b.hs  1b

从bucket[x13]遍历到bucket[0]。如果找到sel==cmd，则跳转到CacheHit （传入的参数），否则x13--

该轮遍历结束后，x13位置：

1
[bucket0][bucket1][bucket2][bucket3][bucket4][bucket5][bucket6][bucket7]
2
     ^
3
     │
4
    x13

从后往中遍历#

1
  // wrap-around:
2
  //   p10 = first bucket
3
  //   p11 = mask (and maybe other bits on LP64)
4
  //   p12 = _cmd & mask
5
  //
6
  // A full cache can happen with CACHE_ALLOW_FULL_UTILIZATION.
7
  // So stop when we circle back to the first probed bucket
8
  // rather than when hitting the first bucket again.
9
  //
10
  // Note that we might probe the initial bucket twice
11
  // when the first probed slot is the last entry.
12

13

14
#if CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_HIGH_16_BIG_ADDRS
15
  add  p13, p10, w11, UXTW #(1+PTRSHIFT)
16
            // p13 = buckets + (mask << 1+PTRSHIFT)
17
#elif CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_HIGH_16
18
  add  p13, p10, p11, LSR #(48 - (1+PTRSHIFT))
19
            // p13 = buckets + (mask << 1+PTRSHIFT)
20
            // see comment about maskZeroBits
21
#elif CACHE_MASK_STORAGE == CACHE_MASK_STORAGE_LOW_4
22
  add  p13, p10, p11, LSL #(1+PTRSHIFT)
23
            // p13 = buckets + (mask << 1+PTRSHIFT)
24
#else
25
#error Unsupported cache mask storage for ARM64.
26
#endif
27
  add  p12, p10, p12, LSL #(1+PTRSHIFT)
28
            // p12 = first probed bucket
29

30
            // do {
31
4:  ldp  p17, p9, [x13], #-BUCKET_SIZE  //     {imp, sel} = *bucket--
32
  cmp  p9, p1        //     if (sel == _cmd)
33
  b.eq  2b        //         goto hit
34
  cmp  p9, #0        // } while (sel != 0 &&
35
  ccmp  p13, p12, #0, ne    //     bucket > first_probed)
36
  b.hi  4b

接着，x13指向buckets最后一个元素：

1
[bucket0][bucket1][bucket2][bucket3][bucket4][bucket5][bucket6][bucket7]
2
                                                                ^
3
                                                                │
4
                                                               x13

然后向前遍历，查找sel==_cmd。有就跳到CacheHit，否则x13--，直到x13==x12

结果#

如果找不到，就跳到MissLabelDynamic。对于objc_msgSend来说，是objc_msgSend_uncached

而如果跳转成功呢？CacheHit会根据传入的参数分为三种情况：

1
// CacheHit: x17 = cached IMP, x10 = address of buckets, x1 = SEL, x16 = isa
2
.macro CacheHit
3
.if $0 == NORMAL
4
  TailCallCachedImp x17, x10, x1, x16  // authenticate and call imp
5
.elseif $0 == GETIMP
6
  mov  p0, p17
7
  cbz  p0, 9f          // don't ptrauth a nil imp
8
  AuthAndResignAsIMP x0, x10, x1, x16, x17  // authenticate imp and re-sign as IMP
9
9:  ret            // return IMP
10
.elseif $0 == LOOKUP
11
  // No nil check for ptrauth: the caller would crash anyway when they
12
  // jump to a nil IMP. We don't care if that jump also fails ptrauth.
13
  AuthAndResignAsIMP x17, x10, x1, x16, x10  // authenticate imp and re-sign as IMP
14
  cmp  x16, x15
15
  cinc  x16, x16, ne      // x16 += 1 when x15 != x16 (for instrumentation ; fallback to the parent class)
16
  ret        // return imp via x17
17
.else
18
.abort oops
19
.endif
20
.endmacro

当然，objc_msgSend调用的是NORMAL类型的，实际是跳转到TailCallCachedImp。TailCallCachedImp 的作用，实际是传入imp、SEL并执行跳转。

1
macro TailCallCachedImp
2
  // $0 = cached imp, $1 = address of cached imp, $2 = SEL, $3 = isa
3
  eor  $0, $0, $3
4
.ifndef LTailCallCachedImpIndirectBranch
5
LTailCallCachedImpIndirectBranch:
6
.endif
7
  br  $0
8
.endmacro

objc_msgSend_uncached#

1
.macro MethodTableLookup
2

3
  SAVE_REGS MSGSEND
4

5
  // lookUpImpOrForward(obj, sel, cls, LOOKUP_INITIALIZE | LOOKUP_RESOLVER)
6
  // receiver and selector already in x0 and x1
7
  mov  x2, x16
8
  mov  x3, #3
9
  bl  _lookUpImpOrForward
10

11
  // IMP in x0
12
  mov  x17, x0
13

14
  RESTORE_REGS MSGSEND
15

16
.endmacro
17

18
  STATIC_ENTRY __objc_msgSend_uncached
19
  UNWIND __objc_msgSend_uncached, FrameWithNoSaves
20

21
  // THIS IS NOT A CALLABLE C FUNCTION
22
  // Out-of-band p15 is the class to search
23

24
  MethodTableLookup
25
  TailCallFunctionPointer x17
26

27
  END_ENTRY __objc_msgSend_uncached

1
.macro TailCallFunctionPointer
2
  // $0 = function pointer value
3
  br  $0
4
.endmacro

逻辑很简单，就是走到MethodTableLookup 这个宏，然后再跳到x17的地址上。

lookUpImpOrForward#

这个方法是用C写的：

objc4/runtime/objc-runtime-new.mm at fb265098298302243cd7eeaa1f63f0ba7786dd9a · apple-oss-distributions/objc4

Contribute to apple-oss-distributions/objc4 development by creating an account on GitHub.

github.com

1
NEVER_INLINE
2
IMP lookUpImpOrForward(id inst, SEL sel, Class cls, int behavior)
3
{
4
    const IMP forward_imp = (IMP)_objc_msgForward_impcache;
5
    IMP imp = nil;
6
    Class curClass;
7

8
    lockdebug::assert_unlocked(&runtimeLock.get());
9

10
    if (slowpath(!cls->isInitialized())) {
11
        // The first message sent to a class is often +new or +alloc, or +self
12
        // which goes through objc_opt_* or various optimized entry points.
13
        //
14
        // However, the class isn't realized/initialized yet at this point,
15
        // and the optimized entry points fall down through objc_msgSend,
16
        // which ends up here.
17
        //
18
        // We really want to avoid caching these, as it can cause IMP caches
19
        // to be made with a single entry forever.
20
        //
21
        // Note that this check is racy as several threads might try to
22
        // message a given class for the first time at the same time,
23
        // in which case we might cache anyway.
24
        behavior |= LOOKUP_NOCACHE;
25
    }
26

27
    // runtimeLock is held during isRealized and isInitialized checking
28
    // to prevent races against concurrent realization.
29

30
    // runtimeLock is held during method search to make
31
    // method-lookup + cache-fill atomic with respect to method addition.
32
    // Otherwise, a category could be added but ignored indefinitely because
33
    // the cache was re-filled with the old value after the cache flush on
34
    // behalf of the category.
35

36
    runtimeLock.lock();
37

38
    // We don't want people to be able to craft a binary blob that looks like
39
    // a class but really isn't one and do a CFI attack.
40
    //
41
    // To make these harder we want to make sure this is a class that was
42
    // either built into the binary or legitimately registered through
43
    // objc_duplicateClass, objc_initializeClassPair or objc_allocateClassPair.
44
    checkIsKnownClass(cls);
45

46
    cls = realizeAndInitializeIfNeeded_locked(inst, cls, behavior & LOOKUP_INITIALIZE);
47
    // runtimeLock may have been dropped but is now locked again
48
    lockdebug::assert_locked(&runtimeLock.get());
49
    curClass = cls;
50

51
    // The code used to lookup the class's cache again right after
52
    // we take the lock but for the vast majority of the cases
53
    // evidence shows this is a miss most of the time, hence a time loss.
54
    //
55
    // The only codepath calling into this without having performed some
56
    // kind of cache lookup is class_getInstanceMethod().
57

58
    // Has this class been disabled? Act like a message to nil.
59
    if (!cls || !cls->ISA()) {
60
#if __arm64__
61
        imp = _objc_returnNil;
62
        goto done;
63
#elif __x86_64
64
        if (behavior & LOOKUP_FPRET)
65
            imp = _objc_msgNil_fpret;
66
        else if (behavior & LOOKUP_FP2RET)
67
            imp = _objc_msgNil_fp2ret;
68
        else
69
            imp = _objc_msgNil;
70

71
        // We can't cache these on x86, in case some other caller tries sending
72
        // this selector with a different return type. If we con't cache then we
73
        // always come back here, and always choose the correct IMP for the
74
        // caller's expected return type.
75
        behavior |= LOOKUP_NOCACHE;
76

77
        goto done;
78
#else
79
#error Don't know how to handle messages to disabled classes on this target.
80
#endif
81
    }
82

83
    for (unsigned attempts = unreasonableClassCount();;) {
84
        if (curClass->cache.isConstantOptimizedCache(/* strict */true)) {
85
#if CONFIG_USE_PREOPT_CACHES
86
            imp = cache_getImp(curClass, sel);
87
            if (imp) goto done_unlock;
88
            curClass = curClass->cache.preoptFallbackClass();
89
#endif
90
        } else {
91
            // curClass method list.
92
            method_t *meth = getMethodNoSuper_nolock(curClass, sel);
93
            if (meth) {
94
                imp = meth->imp(false);
95
                goto done;
96
            }
97

98
            if (slowpath((curClass = curClass->getSuperclass()) == nil)) {
99
                // No implementation found, and method resolver didn't help.
100
                // Use forwarding.
101
                imp = forward_imp;
102
                break;
103
            }
104
        }
105

106
        // Halt if there is a cycle in the superclass chain.
107
        if (slowpath(--attempts == 0)) {
108
            _objc_fatal("Memory corruption in class list.");
109
        }
110

111
        // Superclass cache.
112
        imp = cache_getImp(curClass, sel);
113
        if (slowpath(imp == forward_imp)) {
114
            // Found a forward:: entry in a superclass.
115
            // Stop searching, but don't cache yet; call method
116
            // resolver for this class first.
117
            break;
118
        }
119
        if (fastpath(imp)) {
120
            // Found the method in a superclass. Cache it in this class.
121
            goto done;
122
        }
123
    }
124

125
    // No implementation found. Try method resolver once.
126

127
    if (slowpath(behavior & LOOKUP_RESOLVER)) {
128
        behavior ^= LOOKUP_RESOLVER;
129
        return resolveMethod_locked(inst, sel, cls, behavior);
130
    }
131

132
 done:
133
    if (fastpath((behavior & LOOKUP_NOCACHE) == 0)) {
134
#if CONFIG_USE_PREOPT_CACHES
135
        while (cls->cache.isConstantOptimizedCache(/* strict */true)) {
136
            cls = cls->cache.preoptFallbackClass();
137
        }
138
#endif
139
        log_and_fill_cache(cls, imp, sel, inst, curClass);
140
    }
141
#if CONFIG_USE_PREOPT_CACHES
142
 done_unlock:
143
#endif
144
    runtimeLock.unlock();
145
    if (slowpath((behavior & LOOKUP_NIL) && imp == forward_imp)) {
146
        return nil;
147
    }
148
    return imp;
149
}

其实，可以分为三部分：

类初始化。类信息可能还在只读区域里，需要把这些信息挪到可读可写的区域。
从类信息查找selector对应的函数指针IMP。
缓存SEL和IMP。

类初始化#

调用关系：

realizeAndInitializeIfNeeded_locked -> realizeClassMaybeSwiftAndLeaveLocked -> realizeClassMaybeSwiftMaybeRelock

1
/***********************************************************************
2
* realizeClassMaybeSwift (MaybeRelock / AndUnlock / AndLeaveLocked)
3
* Realize a class that might be a Swift class.
4
* Returns the real class structure for the class.
5
* Locking:
6
*   runtimeLock must be held on entry
7
*   runtimeLock may be dropped during execution
8
*   ...AndUnlock function leaves runtimeLock unlocked on exit
9
*   ...AndLeaveLocked re-acquires runtimeLock if it was dropped
10
* This complication avoids repeated lock transitions in some cases.
11
**********************************************************************/
12
static Class
13
realizeClassMaybeSwiftMaybeRelock(Class cls, mutex_t& lock, bool leaveLocked)
14
{
15
    lockdebug::assert_locked(&lock);
16

17
    if (!cls->isSwiftStable_ButAllowLegacyForNow()) {
18
        // Non-Swift class. Realize it now with the lock still held.
19
        // fixme wrong in the future for objc subclasses of swift classes
20
        cls = realizeClassWithoutSwift(cls, nil);
21
        if (!leaveLocked) lock.unlock();
22
    } else {
23
        // Swift class. We need to drop locks and call the Swift
24
        // runtime to initialize it.
25
        lock.unlock();
26
        cls = realizeSwiftClass(cls);
27
        ASSERT(cls->isRealized());    // callback must have provoked realization
28
        if (leaveLocked) lock.lock();
29
    }
30

31
    return cls;
32
}

这里分为两种情况：

对objc类初始化
对swift类初始化

对objc类初始化#

1
/***********************************************************************
2
* realizeClassWithoutSwift
3
* Performs first-time initialization on class cls,
4
* including allocating its read-write data.
5
* Does not perform any Swift-side initialization.
6
* Returns the real class structure for the class.
7
* Locking: runtimeLock must be write-locked by the caller
8
**********************************************************************/
9
static Class realizeClassWithoutSwift(Class cls, Class previously)
10
{
11
    lockdebug::assert_locked(&runtimeLock.get());
12

13
    class_rw_t *rw;
14
    Class supercls;
15
    Class metacls;
16

17
    if (!cls) return nil;
18
    if (cls->isRealized()) {
19
        validateAlreadyRealizedClass(cls);
20
        return cls;
21
    }
22
    ASSERT(cls == remapClass(cls));
23

24
    // fixme verify class is not in an un-dlopened part of the shared cache?
25

26
    auto ro = cls->safe_ro();
27
    auto isMeta = ro->flags & RO_META;
28
    if (ro->flags & RO_FUTURE) {
29
        // This was a future class. rw data is already allocated.
30
        rw = cls->data();
31
        ro = cls->data()->ro();
32
        ASSERT(!isMeta);
33
        cls->changeInfo(RW_REALIZED|RW_REALIZING, RW_FUTURE);
34
    } else {
35
        // Normal class. Allocate writeable class data.
36
        rw = objc::zalloc<class_rw_t>();
37
        rw->set_ro(ro);
38
        rw->flags = RW_REALIZED|RW_REALIZING|isMeta;
39
        cls->setData(rw);
40
    }
41

42
    cls->cache.initializeToEmptyOrPreoptimizedInDisguise();
43

44
#if FAST_CACHE_META
45
    if (isMeta) cls->cache.setBit(FAST_CACHE_META);
46
#endif
47

48
    // Choose an index for this class.
49
    // Sets cls->instancesRequireRawIsa if indexes no more indexes are available
50
    cls->chooseClassArrayIndex();
51

52
    if (PrintConnecting) {
53
        _objc_inform("CLASS: realizing class '%s'%s %p %p #%u %s%s",
54
                     cls->nameForLogging(), isMeta ? " (meta)" : "",
55
                     (void*)cls, ro, cls->classArrayIndex(),
56
                     cls->isSwiftStable() ? "(swift)" : "",
57
                     cls->isSwiftLegacy() ? "(pre-stable swift)" : "");
58
    }
59

60
    // Realize superclass and metaclass, if they aren't already.
61
    // This needs to be done after RW_REALIZED is set above, for root classes.
62
    // This needs to be done after class index is chosen, for root metaclasses.
63
    // This assumes that none of those classes have Swift contents,
64
    //   or that Swift's initializers have already been called.
65
    //   fixme that assumption will be wrong if we add support
66
    //   for ObjC subclasses of Swift classes.
67
    supercls = realizeClassWithoutSwift(remapClass(cls->getSuperclass()), nil);
68
    metacls = realizeClassWithoutSwift(remapClass(cls->ISA()), nil);
69

70
    // If there's no superclass and this is not a root class, then we have a
71
    // missing weak superclass. Disable the class and return.
72
    if (!supercls && !(cls->safe_ro()->flags & RO_ROOT)) {
73
        if (PrintConnecting)
74
            _objc_inform("CLASS: '%s'%s %p has missing weak superclass, disabling.",
75
                         cls->nameForLogging(), isMeta ? " (meta)" : "", (void *)cls);
76
        addRemappedClass(cls, nil);
77

78
        // Set the metaclass to nil to signal that this class is disabled.
79
        // Root classes have a nil superclass, but all (non-disabled) classes
80
        // have a non-nil isa pointer, so this can be used as a quick check for
81
        // disabled classes.
82
        cls->initIsa(nil);
83

84
        return nil;
85
    }
86

87
#if SUPPORT_NONPOINTER_ISA
88
    if (isMeta) {
89
        // Metaclasses do not need any features from non pointer ISA
90
        // This allows for a faspath for classes in objc_retain/objc_release.
91
        cls->setInstancesRequireRawIsa();
92
    } else {
93
        // Disable non-pointer isa for some classes and/or platforms.
94
        // Set instancesRequireRawIsa.
95
        bool instancesRequireRawIsa = cls->instancesRequireRawIsa();
96
        bool rawIsaIsInherited = false;
97
        static bool hackedDispatch = false;
98
        const char *name;
99

100
        if (DisableNonpointerIsa) {
101
            // Non-pointer isa disabled by environment or app SDK version
102
            instancesRequireRawIsa = true;
103
        }
104
        else if (!hackedDispatch
105
                 && (name = ro->getName()) // Yes, we mean to assign here
106
                 && 0 == strcmp(name, "OS_object"))
107
        {
108
            // hack for libdispatch et al - isa also acts as vtable pointer
109
            hackedDispatch = true;
110
            instancesRequireRawIsa = true;
111
        }
112
        else if (supercls  &&  supercls->getSuperclass()  &&
113
                 supercls->instancesRequireRawIsa())
114
        {
115
            // This is also propagated by addSubclass()
116
            // but nonpointer isa setup needs it earlier.
117
            // Special case: instancesRequireRawIsa does not propagate
118
            // from root class to root metaclass
119
            instancesRequireRawIsa = true;
120
            rawIsaIsInherited = true;
121
        }
122

123
        if (instancesRequireRawIsa) {
124
            cls->setInstancesRequireRawIsaRecursively(rawIsaIsInherited);
125
        }
126
    }
127
// SUPPORT_NONPOINTER_ISA
128
#endif
129

130
    // Update superclass and metaclass in case of remapping
131
    cls->setSuperclass(supercls);
132
    cls->initClassIsa(metacls);
133

134
    // Reconcile instance variable offsets / layout.
135
    // This may reallocate class_ro_t, updating our ro variable.
136
    if (supercls  &&  !isMeta) reconcileInstanceVariables(cls, supercls, ro);
137

138
    // Set fastInstanceSize if it wasn't set already.
139
    cls->setInstanceSize(ro->instanceSize);
140

141
    // Copy some flags from ro to rw
142
    if (ro->flags & RO_HAS_CXX_STRUCTORS) {
143
        cls->setHasCxxDtor();
144
        if (! (ro->flags & RO_HAS_CXX_DTOR_ONLY)) {
145
            cls->setHasCxxCtor();
146
        }
147
    }
148

149
    // Propagate the associated objects forbidden flag from ro or from
150
    // the superclass.
151
    if ((ro->flags & RO_FORBIDS_ASSOCIATED_OBJECTS) ||
152
        (supercls && supercls->forbidsAssociatedObjects()))
153
    {
154
        rw->flags |= RW_FORBIDS_ASSOCIATED_OBJECTS;
155
    }
156

157
    // Connect this class to its superclass's subclass lists
158
    if (supercls) {
159
        addSubclass(supercls, cls);
160
    } else {
161
        addRootClass(cls);
162
    }
163

164
    // Attach categories
165
    methodizeClass(cls, previously);
166

167
    return cls;
168
}

具体做了什么？

分配类的读写空间（rwdata）
初始化缓存
初始化superclass、metaclass
Tagged Pointer优化
rwdata初始化（methodizeClass）、初始化方法表
- 安装 class 自己的方法
- 处理 preoptimized method lists
- root metaclass 处理
- attach categories

对swift类初始化#

实现逻辑在realizeSwiftClass里

1
/***********************************************************************
2
* realizeSwiftClass
3
* Performs first-time initialization on class cls,
4
* including allocating its read-write data,
5
* and any Swift-side initialization.
6
* Returns the real class structure for the class.
7
* Locking: acquires runtimeLock indirectly
8
**********************************************************************/
9
static Class realizeSwiftClass(Class cls)
10
{
11
    lockdebug::assert_unlocked(&runtimeLock.get());
12

13
    // Some assumptions:
14
    // * Metaclasses never have a Swift initializer.
15
    // * Root classes never have a Swift initializer.
16
    //   (These two together avoid initialization order problems at the root.)
17
    // * Unrealized non-Swift classes have no Swift ancestry.
18
    // * Unrealized Swift classes with no initializer have no ancestry that
19
    //   does have the initializer.
20
    //   (These two together mean we don't need to scan superclasses here
21
    //   and we don't need to worry about Swift superclasses inside
22
    //   realizeClassWithoutSwift()).
23

24
    // fixme some of these assumptions will be wrong
25
    // if we add support for ObjC sublasses of Swift classes.
26

27
#if DEBUG
28
    runtimeLock.lock();
29
    ASSERT(remapClass(cls) == cls);
30
    ASSERT(cls->isSwiftStable_ButAllowLegacyForNow());
31
    ASSERT(!cls->isMetaClassMaybeUnrealized());
32
    ASSERT(cls->getSuperclass());
33
    runtimeLock.unlock();
34
#endif
35

36
    // Look for a Swift metadata initialization function
37
    // installed on the class. If it is present we call it.
38
    // That function in turn initializes the Swift metadata,
39
    // prepares the "compiler-generated" ObjC metadata if not
40
    // already present, and calls _objc_realizeSwiftClass() to finish
41
    // our own initialization.
42

43
    if (auto init = cls->swiftMetadataInitializer()) {
44
        if (PrintConnecting) {
45
            _objc_inform("CLASS: calling Swift metadata initializer "
46
                         "for class '%s' (%p)", cls->nameForLogging(), cls);
47
        }
48

49
        Class newcls = init(cls, nil);
50

51
        if (cls != newcls) {
52
            mutex_locker_t lock(runtimeLock);
53
            addRemappedClass(cls, newcls);
54
        }
55

56
        return newcls;
57
    }
58
    else {
59
        // No Swift-side initialization callback.
60
        // Perform our own realization directly.
61
        mutex_locker_t lock(runtimeLock);
62
        return realizeClassWithoutSwift(cls, nil);
63
    }
64
}

实际上会调用swift runtime的初始化函数。

查找IMP#

这一步的逻辑很简单：

1
for (unsigned attempts = unreasonableClassCount();;) {
2
        if (curClass->cache.isConstantOptimizedCache(/* strict */true)) {
3
#if CONFIG_USE_PREOPT_CACHES
4
            imp = cache_getImp(curClass, sel);
5
            if (imp) goto done_unlock;
6
            curClass = curClass->cache.preoptFallbackClass();
7
#endif
8
        } else {
9
            // curClass method list.
10
            method_t *meth = getMethodNoSuper_nolock(curClass, sel);
11
            if (meth) {
12
                imp = meth->imp(false);
13
                goto done;
14
            }
15

16
            if (slowpath((curClass = curClass->getSuperclass()) == nil)) {
17
                // No implementation found, and method resolver didn't help.
18
                // Use forwarding.
19
                imp = forward_imp;
20
                break;
21
            }
22
        }
23

24
        // Halt if there is a cycle in the superclass chain.
25
        if (slowpath(--attempts == 0)) {
26
            _objc_fatal("Memory corruption in class list.");
27
        }
28

29
        // Superclass cache.
30
        imp = cache_getImp(curClass, sel);
31
        if (slowpath(imp == forward_imp)) {
32
            // Found a forward:: entry in a superclass.
33
            // Stop searching, but don't cache yet; call method
34
            // resolver for this class first.
35
            break;
36
        }
37
        if (fastpath(imp)) {
38
            // Found the method in a superclass. Cache it in this class.
39
            goto done;
40
        }
41
    }

isConstantOptimizedCache -> dyld塞的prebuild buckets。如果有，尝试去读取
否则，调用getMethodNoSuper_nolock ，去查方法表

1
/***********************************************************************
2
 * getMethodNoSuper_nolock
3
 * fixme
4
 * Locking: runtimeLock must be read- or write-locked by the caller
5
 **********************************************************************/
6
static method_t *
7
getMethodNoSuper_nolock(Class cls, SEL sel)
8
{
9
    lockdebug::assert_locked(&runtimeLock.get());
10

11
    ASSERT(cls->isRealized());
12
    // fixme nil cls?
13
    // fixme nil sel?
14

15
    auto alternates = cls->data()->methodAlternates();
16

17
    if (auto *relativeList = alternates.relativeList)
18
        return getMethodFromRelativeList(relativeList, sel);
19

20
    if (alternates.list)
21
        return getMethodFromListArray(&alternates.list, 1, sel);
22

23
    if (auto *array = alternates.array) {
24
        auto listAlternates = array->listAlternates();
25
        if (listAlternates.oneList)
26
            return getMethodFromListArray(&listAlternates.oneList, 1, sel);
27
        if (auto innerArray = listAlternates.array)
28
            return getMethodFromListArray(innerArray, listAlternates.arrayCount, sel);
29
        if (auto *relativeList = listAlternates.listList)
30
            return getMethodFromRelativeList(relativeList, sel);
31
    }
32

33
    return nil;
34
}

cls->data() 对应的就是class的rwdata。而methodAlternative的代码如下：

1
// Get the class's method lists without wrapping the different
2
    // representations in a method_array_t. This allows the caller to directly
3
    // access the underlying representations and have separate code for them,
4
    // rather than relying on the iterator abstraction being sufficiently
5
    // optimized. This exists for getMethodNoSuper_nolock to call, other callers
6
    // should be able to just use methods().
7
    ALWAYS_INLINE
8
    MethodListAlternates methodAlternates() const {
9
        MethodListAlternates result = {};
10
        auto v = get_ro_or_rwe();
11
        if (v.is<class_rw_ext_t *>()) {
12
            result.array = &v.get<class_rw_ext_t *>(&ro_or_rw_ext)->methods;
13
        } else {
14
            auto &baseMethods = v.get<const class_ro_t *>(&ro_or_rw_ext)->baseMethods;
15
            result.list = baseMethods.dyn_cast<method_list_t *>();
16
            result.relativeList = baseMethods.dyn_cast<relative_list_list_t<method_list_t> *>();
17
        }
18
        return result;
19
    }

功能为获取class的方法表

写入缓存#

最后，将拿到的IMP写入到cache里，并返回IMP，为了方便下次做查询。

1
done:
2
    if (fastpath((behavior & LOOKUP_NOCACHE) == 0)) {
3
#if CONFIG_USE_PREOPT_CACHES
4
        while (cls->cache.isConstantOptimizedCache(/* strict */true)) {
5
            cls = cls->cache.preoptFallbackClass();
6
        }
7
#endif
8
        log_and_fill_cache(cls, imp, sel, inst, curClass);
9
    }
10
#if CONFIG_USE_PREOPT_CACHES
11
 done_unlock:
12
#endif
13
    runtimeLock.unlock();
14
    if (slowpath((behavior & LOOKUP_NIL) && imp == forward_imp)) {
15
        return nil;
16
    }
17
    return imp;

1
/***********************************************************************
2
* log_and_fill_cache
3
* Log this method call. If the logger permits it, fill the method cache.
4
* cls is the method whose cache should be filled.
5
* implementer is the class that owns the implementation in question.
6
**********************************************************************/
7
static void
8
log_and_fill_cache(Class cls, IMP imp, SEL sel, id receiver, Class implementer)
9
{
10
#if SUPPORT_MESSAGE_LOGGING
11
    if (slowpath(objcMsgLogEnabled && implementer)) {
12
        bool cacheIt = logMessageSend(implementer->isMetaClass(),
13
                                      cls->nameForLogging(),
14
                                      implementer->nameForLogging(),
15
                                      sel);
16
        if (!cacheIt) return;
17
    }
18
#endif
19
    if (slowpath(msgSendCacheMissHook.isSet())) {
20
        auto hook = msgSendCacheMissHook.get();
21
        hook(cls, receiver, sel, imp);
22
    }
23

24
    cls->cache.insert(sel, imp, receiver);
25
}

收尾#

最后，执行栈回到TailCallFunctionPointer x17上，跳转到IMP对应的地址上，完成一次消息调用。

总结#

本文稍微深入得讲了下objc_msgSend的原理。我本来以为，objc_msgSend无非就是一个类似于消息中心一样的东西，没想到Apple针对这个做了很多非常细致的性能优化。

目前为止，Objective-C运行时消息派发的部分已经全部完结了🤮。接下来，终于可以回到Kotlin Native部分。